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POSH AND ASSOCIATED PROTEINS 

BACKGROUND 

Potential drug target validation involves determining whether a DNA, RNA 
or protein molecule is implicated in a disease process and is therefore a suitable 
target for development of new therapeutic drugs. Drug discovery, the process by 
which bioactive compounds are identified and characterized, is a critical step in the 
development of new treatments for human diseases. The landscape of drug 
discovery has changed dramatically due to the genomics revolution. DNA and 
protein sequences are yielding a host of new drug targets and an enormous amount 
of associated information. 

The identification of genes and proteins involved in various disease states or 
key biological processes, such as inflammation and immune response, is a vital part 
of the drug design process. . Many diseases' and disorders could be treated or 
prevented by decreasing the expression of one or more genes involved in the 
molecular etiology of the condition if the appropriate molecular target could be. 
identified and appropriate antagonists developed. For example, cancer, in which one 
or more cellular oncogenes become activated and result in the unchecked 
progression of cell cycle processes, could be treated by antagonizing appropriate cell 
cycle control genes. Furthermore many human genetic diseases, such as 
Huntington's disease, and certain prion conditions, which are influenced by both 
genetic and epigenetic factors, result from the inappropriate activity of a polypeptide 
as opposed to the complete loss of its function. Accordingly, antagonizing the 
aberrant function of such mutant genes would provide a means of treatment. 
Additionally, infectious diseases such as HIV have been successfully treated with 
molecular antagonists targeted to specific essential retroviral proteins such as HIV 
protease or reverse transcriptase. Drug therapy strategies for treating such diseases 
and disorders have frequently employed molecular antagonists which target the 
polypeptide product of the disease gene(s). However the discovery of relevant gene 
or protein targets is often difficult and time consuming. 

One area of particular interest is the identification of host genes and proteins 
that are co-opted by viruses during the viral life cycle. The serious and incurable 

9131718 1 



nature of many viral diseases, coupled with the high rate of mutations found in many 
viruses, makes the identification of antiviral agents a high priority for the 
improvement of world health. Genes and proteins involved in a viral life cycle are 
also appealing as a subject for investigation because such genes and proteins will 
5 typically have additional activities in the host cell and may play a role in other non- 
viral disease states. 

Viral maturation involves the proteolytic processing of the Gag proteins and 
the activity of various host proteins. It is believed that cellular machineries for 
exo/endocytosis and for ubiquitin conjugation may be involved in the maturation. In 
10 particular, the assembly, budding and subsequent release of retroid viruses, RNA 
viruses and envelope viruses, such as various retroviruses, rhabdoviruses, 
lentiviruses, and filoviruses may involve the Gag polyprotein. After its synthesis, 
Gag is targeted to the plasma membrane where it induces budding of nascent virus 
particles. 

15 The role of ubiquitin in virus assembly was suggested by Dunigan et al. 

(1988, Virology 165, 310, Meyers et al. 1991, Virology 180, 602), who observed 
that mature virus particles were enriched in unconjugated ubiquitin. More recently, 
it was shown that proteasome inhibitors suppress the release of HIV-1, HTV-2 and 
virus-like particles derived from SIV and RSV Gag. Also, inhibitors affect Gag 

20 processing and maturation into infectious particles (Schubert et al 2000, PNAS 97, 
13057, Harty et al. 2000, PNAS 97, 13871, Strack et al. 2000, PNAS 97, 13063, 
Patnaik et al. 2000, PNAS 97, 13069). 

It is well known in the art that ubiquitin-mediated proteolysis is the major 
pathway for the selective, controlled degradation of intracellular proteins in 

25 eukaryotic cells. Ubiquitin modification of a variety of protein targets within the 
cell appears to be important in a number of basic cellular functions such as 
regulation of gene expression, regulation of the cell-cycle, modification of cell 
surface receptors, biogenesis of ribosomes, and DNA repair. One major function of 
the ubiquitin-mediated system is to control the half-lives of cellular proteins. The 

30 half-life of different proteins can range from a few minutes to several days, and can 
vary considerably depending on the cell-type, nutritional and environmental 
conditions, as well as the stage of the cell-cycle. 
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Targeted proteins undergoing selective degradation, presumably through the 
actions of a ubiquitin-dependent proteosome, are covalently tagged with ubiquitin 
through the formation of an isopeptide bond between the C-terminal glycyl residue 
of ubiquitin and a specific lysyl residue in the substrate protein. This process is 
5 catalyzed by a ubiqui tin-activating enzyme (El) and a ubiquitin-conjugating enzyme 
(E2), and in some instances may also require auxiliary substrate recognition proteins 
(E3s). Following the linkage of the first ubiquitin chain, additional molecules of 
ubiquitin may be attached to lysine side chains of the previously conjugated moiety 
to form branched multi-ubiquitin chains. 
10 The conjugation of ubiquitin to protein substrates is a multi-step process. In 

an initial ATP requiring step, a thioester is formed between the C-tenninus of 
ubiquitin and an internal cysteine residue of an El enzyme. Activated ubiquitin may 
then be transferred to a specific cysteine on one of several E2 enzymes. Finally, 
these E2 enzymes donate ubiquitin to protein substrates, typically with the assistance 
15 of an E3 protein, also known as a ubiquitin enzyme. In certain instances, substrates 
are recognized directly by the ubiquitin-conjugated E2 enzyme. 

It is also known that the ubiquitin system plays a role in a wide range of 
cellular processes including cell cycle progression, apoptosis, and turnover of many 
membrane receptors. In viral infections, the ubiquitin system is involved not only 
20 with assembly, budding and release, but also with repression of host proteins such as 
p53, which may lead to a viral-induced neoplasm. The HIV Vpu protein interacts 
with an E3 protein that regulates IkB degradation, and is thought to promote 
apoptosis of infected cells by indirectly inhibiting NF-kB activity (Bour et al. (2001) 
J Exp Med 194:1299-311; U.S. Patent No. 5,932,425). The ubiquitin system 
25 regulates protein function by both mono-ubiquitination and poly-ubiquitination, and 
poly-ubiquitination is primarily associated with protein degradation. 

The vesicular trafficking systems are the major pathways for the distribution 
of proteins among cell organelles, the plasma membrane and the extracellular 
medium. The vesicular trafficking systems may be directly or indirectly involved in 
30 a variety of disease states. The major vesicle trafficking systems in eukaryotic cells 
include those systems that are mediated by clathrin-coated vesicles and coatomer- 
coated vesicles. Clathrin-coated vesicles are generally involved in transport, such as 



in the case of receptor mediated endocytosis, between the plasma membrane and the 
early endosomes, as well as from the trans-Golgi network to endosomes. Coatomer- 
coated vesicles include coat protein I (COP-I) coated vesicles and COP-II coated 
vesicles, both of which tend to mediate transport of a variety of molecules between 
5 the ER and Golgi cistemae. In each case, a. vesicle is formed by budding out from a 
portion of membrane that is coated with coat proteins, and the vesicle sheds its coat 
prior to fusing with the target membrane. 

Clathrin coats assemble on the cytoplasmic face of a membrane, forming pits 
that ultimately pinch off to become vesicles. Clathrin itself is composed of two 
10 subunits, the clathrin heavy chain and the clathrin light chain, that form the clathrin 
triskelion. Clathrins associate with a host of other proteins, including the assembly 
protein, API 80, the adaptor complexes (API, AP2, AP3 and AP4), beta-arrestin, 
arrestin 3, auxilin, epsin, Epsl5, v-SNAREs, amphiphysins, dynamin, synaptojanin 
and endophilin. The adaptor complexes promote clathrin cage formation, and help 
15 . connect clathrin up to the membrane, membrane proteins, and many of the preceding 
components. API associates with clathrin coated vesicles derived from the trans- 
Golgi network and contains y, pi, jxl and a I polypeptide chains. AP2 associates 
with endocytic clathrin coated vesicles and contains d, P2, n2, and a2 polypeptides. 
Interactions between the clathrin complex and other proteins are mediated by a 
20 variety of domains found in the complex proteins, such as SH3 (Src homology 3) 
domains, PH (pleckstrin homology) domains, EH domains and NPF domains. 
(Marsh et al. (1999) Science 285:215-20; Pearse et al. (2000) Curr Opin Struct Biol 
10(2):220-8). 

Coatomer-coated vesicle formation is initiated by recruitment of a small 
25 GTPase (eg. ARF or SAR) by its cognate guanine nucleotide excahnge factor (e.g. 
SEC 12, GEA1, GEA2). The initial complex is recognized by a coat protein 
complex (COPI or COPII). The coat then grows across the membrane, and various 
cargo proteins become entrapped in the growing network. The membrane ultimately 
bulges and becomes a vesicle. The coat proteins stimulate the GTPase activity of 
30 the GTPase, and upon hydrolysis of the GTP, the coat proteins are released from the 
complex, uncoating the vesicle. Other proteins associated with coatomer coated 
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vesicles include v-SNAREs, Rab GTPases and various receptors that help recruit the 
appropriate cargo proteins. (Springer et al. (1999) Cell 97:145-48). 

It would be beneficial to identify proteins involved in one or more of these 
processs for use in, among other things, drug screening methods. 

5 

SUMMARY 

In part, the application relates to the ubiquitin ligase, POSH (Plenty Of SH3 
domains), and the discovery of novel interactions between POSH and proteins that 
associate with POSH. By providing novel POSH:POSH-AP interactions, the 
10 application provides, in part, methods for modulating a process that POSH 
participates in by targeting a POSH-AP or the POSH:POSH-AP interaction. 
Furthermore, by providing novel POSH:POSH-AP interactions, the application 
provides, in part, methods for modulating a process that a POSH-AP participates in 
by targeting POSH. As one of skill in the art can readily appreciate, a POSH protein 
15 may form multiple different complexes with POSH-APs, depending on the 
biological context. - i- 

In certain embodiments^ POSH and POSH-associated proteins (POSH-APs) 
play a role in viral maturation. Optionally, POSH and POSH-APs act in the 
assembly or trafficking of complexes that mediate viral release. In one embodiment, 
20 POSH polypeptides and POSH-APs may stimulate ubiquitylation of certain proteins 
or stimulate membrane fusion or both. In certain embodiments, POSH and POSH- 
APs participate in vesicular trafficking. In certain embodiments, POSH and POSH- 
APs regulate a Rac signaling pathway. In certain embodiments, POSH and POSH- 
APs regulate a JNK signaling pathway. In certain embodiments, POSH and POSH- 
25 APs regulate NF-kB and NF-kB signaling. In certain embodiments, POSH and 
POSH-APs participate in autoregulation of POSH polypeptide levels. In certain 
embodiments, POSH and POSH-APs regulate apoptosis. In certain embodiments, 
POSH and POSH-APs participate in cellular positioning of the nucleus. In certain 
embodiments, POSH and POSH-APs participate in attachment of the nuclear 
30 membrane to the actin cytoskeleton. In certain embodiments, POSH and POSH-APs 
participate in cellular responses to homocysteine, including the unfolded protein 
response (UPR), thromboembolic vascular disease, apoptosis of vascular endothelial 
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cells, dysregulation of sterol synthesis (e.g. cholesterol synthesis) positioning of the 
nucleus. 

In some aspects, the application provides POSH and POSH-AP nucleic acid 
sequences and proteins encoded thereby, as well as oligonucleotides derived from 

5 the nucleic acid sequences, antibodies directed to the encoded proteins, screening 
assays to identify agents that modulate POSH, POSH-APs and/or biological events 
affected by POSH and POSH-APs. In certain aspects the application provides 
diagnostic methods for detecting cells infected with a virus, preferably an envelope 
virus, an RNA virus and particulalry a retroid virus. 

10 In one aspect, the application provides an isolated nucleic acid comprising a 

nucleotide sequence which hybridizes under stringent conditions to a sequence of 
SEQ ID NOs: 1, 3, 4, 6, 8 and/or 10 or a sequence complementary thereto. In a 
related embodiment, the nucleic acid is at least about 80%, 90%, 95%, or 97-98%, 
or 100% identical to a sequence corresponding to at least about 12, at least about 15, 

15 at least about 25, at least about 40, at least about 100, at least about 300, at least 
about 500, at least about 1000, or at least about 2500 consecutive nucleotides up to 
the full length of SEQ ID NO: 1, 3, 4, 6, 8 and/or 10, or* sequence complementary 
thereto. 

In one aspect, the application provides an isolated nucleic acid comprising a 
20 nucleotide sequence which hybridizes under stringent conditions to a sequence of 
SEQ ID NOs: 31-35 or a sequence complementary thereto. In a related 
embodiment, the nucleic acid is at least about 80%, 90%, 95%, or 97-98%, or 100% 
identical to a sequence corresponding to at least about 12, at least about 15, at least 
about 25, consecutive nucleotides up to the full length of SEQ ID NO: 31-35, or a 
25 sequence complementary thereto. 

In other embodiments, the application provides a nucleic acid comprising a 
nucleotide sequence which hybridizes under stringent conditions to a sequence of 
SEQ ID Nos. 1, 3, 4, 6, 8 and/or 10, or a nucleotide sequence that is at least about 
80%, 90%, 95%, or 97-98%, or 100% identical to a sequence corresponding to at 
30 least about 12, at least about 15, at least about 25, at least about 40, at least about 
100, at least about 300, at least about 500, at least about 1000, or at least about 2500 
consecutive nucleotides up to the full length of SEQ ID NO: 1, 3, 4, 6, 8 and/or 10, 
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or a sequence complementary thereto, and a transcriptional regulatory sequence 
operably linked to the nucleotide sequence to render the nucleotide sequence 
suitable for use as an expression vector. In another embodiment, the nucleic acid 
may be included in an expression vector capable of replicating in a prokaryotic or 
5 eukaryotic cell. In a related embodiment, the application provides a host cell 
transfected with the expression vector. 

In yet another embodiment, the application provides a substantially pure 
nucleic acid which hybridizes under stringent conditions to a nucleic acid probe 
corresponding to at least about 12, at least about 15, at least about 25, or at least 
10 about 40 consecutive nucleotides up to the full length of SEQ ID NO:l, 3, 4, 6, 8 
and/or 10, or a sequence complementary thereto or up to the full length of the gene 
of which said sequence is a fragment The application also provides an antisense 
oligonucleotide analog which hybridizes under stringent conditions to at least 12, at 
least 25, or at least 50 consecutive nucleotides up to the full length of SEQ ID NO:l 
1 5 and/or 3, or a sequence complementary thereto. • 

In a further embodiment, the application provides a nucleic acid comprising 
a nucleic acid encoding an amino acid sequence as set forth in any of SEQ ID Nos: 
2, 5, 7, 9 or 11, or a nucleic acid complement thereof. In a related embodiment, the 
encoded amino acid sequence is at least about 80%, 90%, 95%, or 97-98%, or 100% 
20 identical to a sequence corresponding to at least about 12, at least about 15, at least 
about 25, or at least about 40, at least about 100, at least about 200, at least about 
300, at least about 400 or at least about 500 consecutive amino acids up to the full 
length of any of SEQ ID Nos:2, 5, 7, 9 or 1 1. 

In another embodiment, the application provides a probe/primer comprising 
25 a substantially purified oligonucleotide, said oligonucleotide containing a region of 
nucleotide sequence which hybridizes under stringent conditions to at least about 12, 
at least about 15, at least about 25, or at least about 40 consecutive nucleotides of 
sense or antisense sequence selected from SEQ ID Nos: 1, 3, 4, 6, 8 and/or 10, or a 
sequence complementary thereto. In preferred embodiments, the probe selectively 
30 hybridizes with a target nucleic acid. In another embodiment, the probe may include 
a label group attached thereto and able to be detected. The label group may be 
selected from radioisotopes, fluorescent compounds, enzymes, and enzyme co- 
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factors. The application further provides arrays of at least about 10, at least about 25, 
at least about 50, or at least about 100 different probes as described above attached 
to a solid support. 

In another aspect, the application provides polypeptides. In one embodiment, 
5 the application pertains to a polypeptide including an amino acid sequence encoded 
by a nucleic acid comprising a nucleotide sequence which hybridizes under stringent 
conditions to a sequence of SEQ ID Nos:l, 3, 4, 6, 8 and/or 10, or a sequence 
complementary thereto, or a fragment comprising at least about 25, or at least about 
40 amino acids thereof. 
10 In a preferred embodiment, the POSH polypeptide comprises a sequence that 

is identical with or homologous to any of SEQ ID Nos: 2, 5, 7, 9 or 11. For 
instance, a POSH polypeptide preferably has an amino acid sequence at least 60% 
homologous to a polypeptide represented by any of SEQ ID Nos:2, 5, 7, 9 or 1 1 and 
polypeptides with higher sequence homologies of, for example, 80%, 90% or 95% 
15 are also contemplated. The POSH polypeptide can comprise a full length protein, 
such as represented in the sequence listings, or it can comprise a fragment of, for 
instance, at least 5, 10, 20, 50, 100, 150, 200, 250, 300, 400 or 500 or more amino 
acids in length. 

In another embodiment, the application provides polypeptides comprising a 
20 sequence that is at least 80%, 90% or 95% identical with or homologous to any of 
SEQ ID Nos: 26-30. 

In another preferred embodiment, the application features a purified or 
recombinant polypeptide fragment of a POSH polypeptide, which polypeptide has 
the ability to modulate, e.g., mimic or antagonize, an activity of a wild-type POSH 
25 protein. Preferably, the polypeptide fragment comprises a sequence identical or 
homologous to an amino acid sequence designated in any of SEQ ID Nos: 2, 5, 7, 9 
or 11. 

Moreover, as described below, the POSH polypeptide can be either an 
agonist (e.g. mimics), or alternatively, an antagonist of a biological activity of a 
30 naturally occurring form of the protein, e.g., the polypeptide is able to modulate the 
intrinsic biological activity of a POSH protein or a POSH complex, such as an 
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enzymatic activity, binding to other cellular components, cellular 
compartmentalization, membrane reorganization and the like. 

The subject proteins can also be provided as chimeric molecules, such as in 
the form of fusion proteins. For instance, the POSH polypeptide can be provided as 
5 a recombinant fusion protein which includes a second polypeptide portion, e.g., a 
second polypeptide having an amino acid sequence unrelated (heterologous) to 
POSH, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the 
second polypeptide portion is an enzymatic activity such as alkaline phosphatase, 
e.g. the second polypeptide portion is an epitope tag, etc. 
10 Yet another aspect of the present application concerns an immunogen 

comprising a POSH polypeptide in an immunogenic preparation, the immunogen 
being capable of eliciting an immune response specific for the POSH polypeptide; 
e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In 
preferred embodiments, the immunogen comprises an antigenic determinant, e.g. a 
1 5 unique determinant, from a protein represented by SEQ ID NO:2. 

In yet another aspect, this application provides antibodies immunoreactive 
with one or more POSH polypeptides. In one embodiment, antibodies are specific 
for an SH3 domain or a RING domain derived from a POSH polypeptide. In a more 
specific embodiment, the domain is part of an amino acid sequence set forth in SEQ 
20 ID NO:2. In a set of exemplary embodiments, an antibody binds to one or more 
SH3 domains represented by amino acids 137-192, 199-258, 448-505 and 832-888 
of SEQ ID NO:2 and are set forth in any one of SEQ ID Nos: 27-20. In another 
exemplary embodiment, an antibody binds to a RING domain represented by amino 
acids 12-52 of SEQ ID NO:2 and is set forth in SEQ ID No: 26. In another 
25 embodiment, the antibodies are immunoreactive with one or more proteins having 
an amino acid sequence that is at least 80% identical, at least 90% identical or at 
least 95% identical to an amino acid sequence as set forth in SEQ ID NO:2. In other 
embodiments, an antibody is immunoreactive with one or more proteins having an 
amino acid sequence that is 85%, 90%, 95%, 98%, 99% or identical to an amino 
30 \ acid sequence as set forth in SEQ ID NO:2. 

In certain embodiments, the subject POSH nucleic acids will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
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transcriptional enhancer sequence, which regulatory sequence is operably linked to 
the POSH sequence. Such regulatory sequences can be used to render the POSH 
sequence suitable for use as an expression vector. 

In yet another aspect, the application provides an assay for screening test 

5 compounds for inhibitors, or alternatively, potentiators, of an interaction between a 
POSH polypeptide and a POSH-AP. An exemplary method includes the steps of (i) 
combining POSH-AP, a POSH polypeptide, and a test compound, e.g., under 
conditions wherein, but for the test compound, the POSH polypeptide and POSH- 
AP are able to interact; and (ii) detecting the formation of a complex which includes 

10 the POSH polypeptide and a POSH-AP. A statistically significant change, such as a 
decrease, in the formation of the complex in the presence of a test compound 
(relative to what is seen in the absence of the test compound) is indicative of a 
modulation, e.g., inhibition, of the interaction between the POSH polypeptide and 
POSH-AP. 

15 In a further embodiment, the application provides an assay for identifying a 

test compound Which inhibits or potentiates the interaction of a POSH polypeptide 
to a POSH-AP, comprising (a) forming a reaction mixture including POSH 
polypeptide, a POSH-AP; and a test compound; and detecting binding of said POSH 
polypeptide to said POSH-AP; wherein a change in the binding of said POSH 

20 polypeptide to said POSH-AP in the presence of the test compound, relative to 
binding in the absence of the test compound, indicates that said test compound 
potentiates or inhibits binding of said POSH polypeptide to said POSH-AP. 

In additional embodiment, the application relates to a method for identifying 
modulators of protein complexes, comprising (a) forming a reaction mixture 

25 comprising a POSH polypeptide, a POSH-AP; and a test compound; (b) contacting 
the reaction mixture with a test agent, and (c) determining the effect of the test agent 
for one or more activities. Exemplary activities include a change in the level of the 
protein complex, a change m the enzymatic activity of the complex, where the 
reaction mixture is a whole cell, a change in the plasma membrane localization of 

30 the complex or a component thereof or a change in the interaction between the 
POSH polypeptide and the POSH-AP. 



- 10- 



An additional embodiment is a screening assay to identify agents that inhibit 
or potentiate the interaction of a POSH polypeptide and a POSH-AP, comprising 
providing a two-hybrid assay system including a first fusion protein comprising a 
POSH polypeptide portion of SEQ ID NO:2, and a second fusion protein comprising 
5 a POSH-AP portion, under conditions wherein said two hybrid assay is sensitive to 
interactions between the POSH polypeptide portion of said first fusion protein and 
said POSH-AP portion of said second polypeptide; measuring a level of interactions 
between said fusion proteins in the presence and in the absence of a test agent; and 
comparing the level of interaction of said fusion proteins, wherein a decrease in the 

10 level of interaction is indicative of an agent that will inhibit the interaction between 
a POSH polypeptide and a POSH-AP. 

In additional aspects, the application provides isolated protein complexes 
including a combination of a POSH polypeptide and at least one POSH-AP. In 
certain embodiments, a POSH complex is related to clathrin-coated vesicle 

15 formation. In a further embodiment, a POSH complex comprises a viral protein, 
such as Gag. In certain embodiments, a POSH complex relates to a ubiquitin related 
activity of POSH, as in the case of POSH complexes comprising ubiquitin (e.g., 
covalent or non-covalent POSH ubiquitin conjugates), an E2, an El or a 
ubiquitination target. In certain embodiments, a POSH complex relates to JNK 

20 signaling, as in the case of POSH complexes comprising a Rac, an MLK, an MKK 
and/or a JNK* In certain embodiments, a POSH complex is part of a cellular 
response to homocysteine, as in the case of a POSH complex comprising a 
HERPUD1 . In certain embodiments, a POSH complex relates to nuclear membrane 
positioning and/or anchoring, as in the case of a POSH complex comprising an 

25 UNC84 polypeptide. 

In an additional aspect, the application provides nucleic acid therapies for 
manipulating POSH. In one embodiment, the application provides a ribonucleic 
acid comprising between 5 and 1000 consecutive nucleotides of a nucleic acid 
sequence that is at least 90%, 95%, 98%, 99% or optionally 100% identical to a 

30 sequence of SEQ ID NO:l and/or 3 or a complement thereof. Optionally the 
ribonucleic acid comprises at least 10, 15, 20, 25, or 30 consecutive nucleotides, and 
no more than 1000, 750, 500 and 250 consecutive nucleotides of a POSH nucleic 
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acid. In certain embodiments the ribonucleic acid is an RNAi oligomer or a 
ribozyme. Preferably, the ribonucleic acid decreases the level of a POSH mRNA. 
Preferred ribonucleic acids comprise a sequence selected from any of SEQ ID Nos: 
15, 16, 18, 19, 21, 22, 24 and 25. 

5 The application also features transgenic non-human animals, e.g. mice, rats, 

rabbits, goats, sheep, dogs, cats, cows, or non-human primates, having a transgene, 
e.g., animals which include (and preferably express) a heterologous form of the 
POSH gene described herein. Such a transgenic animal can serve as an animal 
model for studying viral infections such as HIV infection or for use in drug 

1 0 screening for viral infections. 

In further aspects, the application provides compositions for the delivery of a 
nucleic acid therapy, such as, for example, compositions comprising a liposome 
and/or a pharmaceutically acceptable excipient or carrier. 

In further aspects, the application provides an isolated, purified or 

15 recombinant complex comprising a POSH polypeptide and a POSH-AP. In certain 
«. .. embodiments, the complex comprises a POSH-AP that interacts with a POSH 
polypeptide in a yeast two-hybrid assay. In certain aspects a POSH-AP is selected 
from the group consisting of: UNC48B, MSTP028 and HERPUD1 (or portion 
thereof sufficient for POSH interaction). In certain embodiments, the complex 

20 comprises a POSH-AP that co-immunoprecipates with a POSH polypeptide. In 
certain aspects a POSH-AP is selected from the group consisting of: dynamin II, 
GroEL, an HSP70, an HSC70, a cytokeratin I, a keratin type II subunit, a cytokeratin 
10, a ubiquitin-specific protease (hydrolase) and a Gag polypeptide (such as a a 
Gag-Pol polypeptide). In certain embodiments, a POSH-AP is selected from the 

25 group consisting of MLK1, MLK2, MLK3, MKK4, MKK7, JNK1 and JNK2. In 
certain embodiments, the POSH polypeptide is a POSH RING domain, such as the 
RING domain of SEQ ID NO:26 or a polypeptide at least 90% identical to SEQ ID 
NO:26. In certain preferred embodiment, the complex comprises a POSH RING 
domain and a polypeptide selected from the group consisting of: an HSP70-8, a Gag 

30 polypeptide (such as a Gag-Pol polypeptide) and ubiquitin-specific protease 
(hydrolase). In certain embodiments, the POSH polypeptide is a POSH SH3 
domain, such as the SH3 4 domain of SEQ ID NO:30 or a polypeptide at least 90% 
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identical to SEQ ID NO:30. In certain embodiments, a complex comprises a POSH 
polypeptide lacking a RING domain and a polypeptide selected from the group 
consisting of: UNC48B, MSTP028 and HERPUD1. In certain aspects, the 
application provides methods for identifying a test agent having antiviral or anti- 
5 apoptotic activities by identifying a test agent that disrupts a complex described 
above. 

In certain aspects, the application provides for identifying an agent with 
antiviral activity, the method comprising: a) forming a mixture comprising a test 
agent, a POSH polypeptide and a Gag polypeptide under conditions that, but for the 

10 test agent, permit POSH-mediated ubiquitination of the Gag polypeptide; and b) 
detecting POSH-mediated ubiquitination of the Gag polypeptide, wherein a decrease 
in POSH-mediated ubiquitination of the Gag polypeptide is indicative of an agent 
having antiviral or antiapoptotic activity. In preferred embodiments, the Gag 
polypeptide is a Gag-Pol polypeptide, and particularly an HIV pi 60 Gag-Pol 

15 ^polypeptide. 

In certain aspects, the application provides methods for identifying an 
antiviral or antiapoptotic agent comprising: a) providing a POSH-AP polypeptide 
and a test agent; and b) identifying a test agent that binds to the POSH-AP 
polypeptide. In certain aspects the method comprises a) contacting a POSH-AP 

20 polypeptide with a test agent, and b) identifying a test agent that modulates an 
activity of the POSH-AP. Preferred POSH-APs for use in such a method include 
UNC84B, MSTP028 and HERPUD1. 

In certain aspects the application provides an isolated antibody, or fragment 
thereof, specifically immunoreactive with an epitope of a sequence selected from the 

25 group consisting of SEQ ID NO: 2, SEQ ID No: 26, and SEQ ID NO:30, which 
antibody disrupts the interaction between a polypeptide of SEQ ID NO: 2 and a 
POSH-AP. In a preferred embodiment, the antibody or fragment thereof disrupts the 
interaction between a POSH domain and a POSH-AP selected from the group 
consisting of: UNC84B, MSTP028 and HERPUD1. 

30 In certain aspects, the application provides methods of inhibiting viral 

infections comprising administering an agent to a subject in need thereof wherein 
said agent inhibits the interaction between a POSH polypeptide and a POSH-AP. In 
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certain preferred embodiments, the POSH-AP is a Gag polypeptide, particularly a 
Gag-Pol polypeptide such as the HIV pi 60 Gag-Pol polypeptide. Optionally, the 
infection is an HIV infection. In certain aspects the application provides methods of 
inhibiting viral infection comprising administering an agent to a subject in need 
5 thereof wherein said agent inhibits the POSH-mediated ubiquitination of a Gag 
polypeptide, particularly a Gag-Pol polypeptide such as the HIV pi 60 Gag-Pol 
polypeptide. 

The practice of the present application will employ, unless otherwise 
indicated, conventional techniques of cell biology, cell culture, molecular biology, 
10 transgenic biology, microbiology, recombinant DNA, and immunology, which are 
within the skill of the art Such techniques are explained fully in the literature. See, 
for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, 
Volumes I and H (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 
15 ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. 
D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Haines 
. & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. liss, 
Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A 
Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology 
20 (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. 
Miller and M. P. Calos eds., 1987, Cold Spring Haibor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell 
And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); 
Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. 
25 Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

Other features and advantages of the application will be apparent from the 
following detailed description, and from the claims. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: Human POSH Coding Sequence (SEQ ID NO:l) 
Figure 2: Human POSH Amino Acid Sequence (SEQ ID NO:2) 
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Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 

Figure 4: 5' cDNA fragment of human POSH (public gi:10432611; SEQ ID NO:4) 
Figure 5: N terminus protein fragment of hPOSH (public gi:10432612; SEQ ID 
NO:5) 

5 Figure 6: 3* mRNA fragment of hPOSH (public gi:7959248; SEQ ID NO:6) 

Figure 7: C terminus protein fragment of hPOSH (public gi:7959249; SEQ ID 
NO:7) 

Figure 8: Human POSH full mRNA, annotated sequence 
Figure 9: Domain analysis of human POSH 
10 Figure 10: Diagram of human. POSH nucleic acids. The diagram shows the full- 
length POSH gene and the position of regions amplified by RT-PCR or targeted by 
siRNA used in figure 1 1 . 

Figure 11: Knockdown of POSH mRNA by siRNA duplexes. HeLa SS-6 cells were 
transfected with siRNA against Lamin A/C (lanes 1, 2) or POSH (lanes 3-10). 

15 POSH siRNA was directed against the coding region (153 - lanes 3,4; 155 - lanes » 
.5,6) or the 3'UTR (157 - lanes 7, 8; 159 - lanes 9, 10). Cells were harvested 24 hours 
post-transfection, RNA extracted, and POSH mRNA levels compared by RT-PCR of 
a discrete sequence in the coding region of the POSH gene (see figure 10). GAPDH 
is used an RT-PCR control in each reaction. 

20 Figure 12: POSH affects the release of VLP from cells. A) Phosphohimages of SDS- 
PAGE gels of immunoprecipitations of 35S pulse-chase labeled Gag proteins are 
presented for cell and viral lysates from transfected HeLa cells that were either 
untreated or treated with POSH RNAi (50nM for 48 hours). The time during the 
chase period (1,2,3,4 and 5 hours after the pulse) are presented from left to right for 

25 each image. 

Figure 13: Release of VLP from cells at steady state. Hela cells were transfected 
with an HIV-encoding plasmid and siRNA. Lanes 1 , 3 and 4 were transfected with 
wild-type HIV-encoding plasmid. Lane 2 was transfected with an HIV-encoding 
plasmids which contains a point mutation in p6 (PTAP to ATAP). Control siRNA 
30 (lamin A/C) was transfected to cells in lanes I and 2. siRNA to TsglOl was 
transfected in lane 4 and siRNA to POSH in lane 3. 

Figure 14: Mouse POSH mRNA sequence (public gi: 1 0946921; SEQ ID NO: 8) 
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Figure 15: Mouse POSH Protein sequence (Public gi: 10946922; SEQ ID NO: 9) 
Figure 16: Drosophila melanogaster POSH mRNA sequence (public gi: 17737480; 
SEQ ED NO: 10) 

Figure 17: Drosophila melanogaster POSH protein sequence (public gi:17737481; 
5 SEQIDNO:ll) 

Figure 18: POSH Domain Analysis 

Figure 19: Partial knockdown of human POSH results in four logs reduction of 
HIV1 infectivity. The results from infectivity assay are presented are presented in 
the diagram. The vertical axis shows the percentage of target cells infected, and the 
10 horizontal axis shows the fold dilution of virus stocks used (see Example 4 for 
details of the experiment). The open squares (top line) indicate the results from the 
control, and the closed squares (bottom line) indicate the results from transfecring 
cells with RNAi to POSH. 

Figure 20: Human POSH protein mediates both PTAP- and PPEY-mediated viral 
15 particle release in the context of HTV. Phosphohimages of SDS-PAGE gels of 
immunoprecipitations of 35S pulse-chase labeled Gag proteins are presented for cell 
and viral lysates from transfected HeLa cells. Hela cells were transfected with an 
HIV-encoding plasmid and siRNA. From left to right: the left panel was transfected 
with wild-type HIV-encoding plasmid. The middle panel was transfected with an 
20 HIV-encoding plasmids which contains a point mutation (PPEE to PPEY). The right 
panel was transfected with an HIV-encoding plasmids which contains a point 
mutation in p6 (PTAP to ATAP). From top to bottom: control siRNA (lamin A/C) 
was transfected to cells in the top panel. siRNA to TsglOl was transfected to cells in 
the middle panel and siRNA to POSH was transfected to cells in the bottom panel. 
25 The time during the chase period (0, 1 , 2, 4 and 8 hours after the pulse) are presented 
from left to right for each panel. 

Figure 21: Deletion of the RING domain and the 4 th SH3 domain of human POSH 
protein inhibits HIV1 viral particle release. 
Figure 22: Human POSH has ubiquitin ligase activity 
30 Figure 23 : Human POSH co-immunoprecipitates with RAC 1 

Figure 24: Knock-down of human POSH entraps HIV virus particles in intracellular 
vesicles. HTV virus release was analyzed by electron microscopy following siRNA 



- 16- 



and fiill-length HIV plasmid transfection. Mature viruses were secreted by cells 
transfected with HTV plasmid and non-relevant siRNA (control, bottom panel). 
Knockdown of TsglOl protein resulted in a budding defect, the viruses that were 
released had an immature phenotype (top panel). Knockdown of hPOSH levels 
5 resulted in accumulation of viruses inside the cell in intracellular vesicles (middle 
panel). 

Figure 25: Human POSH is localized at two sites: one at the nucleus and one at the 
golgi (C)) . After HIV transfection POSH golgi localization is enhanced(D). HeLa 
and 293T cells were transfected with pNLenv-1. 24 hours post transfection the cells 

10 were incubated with primary antibodies against POSH and either with anti-p24 Gag 
(B) or an organelle marker as follows: BiP- endoplasmatic reticulum (data not 
shown), GM130- golgi, (C,D) nucleoporin- nuclei matrix (E, F) and histone Hi- 
nucleus (data not shown). Two different flourescence labeled secondary antibodies 
were used. Specific signals were obtained by laser-scanning confocal microscopy, 

15 and individual signals were overlaid to assess their relative positions. POSH is 
localized to the nucleus (A) and partially colocalized with HTV-1 Gag and the golgi 
marker outside the nucleus, presumably in the nuclear matrix (B) following 
transfection. 

Figures 26 and 27 show a schematic protocol and results, respectively, for 
20 identifying proteins that are ubiquitinated through a POSH-mediated process. 

DETAILED DESCRIPTION OF THE APPLICATION 
1. Definitions 

The term "binding" refers to a direct association between two molecules, due 
25 to, for example, covalent, electrostatic, hydrophobic, ionic and/or hydrogen-bond 
interactions under physiological conditions. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid 
sequence encoding a polypeptide with a second amino acid sequence defining a 
domain foreign to and not substantially homologous with any domain of the first 
30 amino acid sequence. A chimeric protein may present a foreign domain which is 
found (albeit in a different protein) in an organism which also expresses the first 
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protein, or it may be an "interspecies", "intergenic", etc. fusion of protein structures 
expressed by different kinds of organisms. 

The terms "compound", "test compound" and 4 *molecule" are used herein 
interchangeably and are meant to include, but are not limited to, peptides, nucleic 

5 acids, carbohydrates, small organic molecules, natural product extract libraries, and 
any other molecules (including, but not limited to, chemicals, metals and 
organometallic compounds). 

The phrase "conservative amino acid substitution" refers to grouping of 
amino acids on the basis of certain common properties. A functional way to define 

10 common properties between individual amino acids is to analyze the normalized 
frequencies of amino acid changes between corresponding proteins of homologous 
organisms (Schulz, G. E. and R. H. Schirmer., Principles of Protein Structure, 
Springer-Verlag). According to such analyses, groups of amino acids may be 
defined where amino acids within a group exchange preferentially with each other, 

15 and therefore resemble each other most in their impact on the overall protein 
structure (Schulz, G. E. and R. H. Schirmer., Principles of Protein Structure, 
Springer-Verlag). Examples of amino acid groups defined in this manner include: 

(i) a charged group, consisting of Glu and Asp, Lys, Arg and His, 

(ii) a positively-charged group, consisting of Lys, Arg and His, 
20 (iii) a negatively-charged group, consisting of Glu and Asp, 

(iv) an aromatic group, consisting of Phe, Tyr and Trp, 

(v) a nitrogen ring group, consisting of His and Trp, 

(vi) a large aliphatic nonpolar group, consisting of Val, Leu and lie, 

(vii) a slightly-polar group, consisting of Met and Cys, 

25 (viii) a small-residue group, consisting of Ser, Thr, Asp, Asn, Gly, Ala, Glu, Gin 
and Pro, 

(ix) an aliphatic group consisting of Val, Leu, He, Met and Cys, and 

(x) a small hydroxyl group consisting of Ser and Thr. 

In addition to the groups presented above, each amino acid residue may form 
30 its own group, and the group formed by an individual amino acid may be referred to 
simply by the one and/or three letter abbreviation for that amino acid commonly 
used in the art. 
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A "conserved residue" is an amino acid that is relatively invariant across a 
range of similar proteins. Often conserved residues will vary only by being replaced 
with a similar amino acid, as described above for "conservative amino acid 
substitution". 

5 The term "domain" as used herein refers to a region of a protein that 

comprises a particular structure and/or performs a particular function. 

The term "envelope virus" as used herein refers to any virus that uses cellular 
membrane and/or any organelle membrane in the viral release process. 

"Homology" or "identity" or "similarity"* refers to sequence similarity 
10 between two peptides or between two nucleic acid molecules. Homology and 
identity can each be determined by comparing a position in each sequence which 
may be aligned for purposes of comparison. When an equivalent position in the 
compared sequences is occupied by the same base or amino acid, then the molecules 
are identical at that position; when the equivalent site occupied by the same or a 
15 similar amino acid residue (e.g., similar in steric and/or electronic nature), then the 
molecules can be referred to as homologous (similar) at that position. Expression as 
a percentage of homology/similarity or identity, refers to a function of the number of 
identical or similar amino acids at positions shared by the compared sequences. A 
sequence which is "unrelated" or "non-homologous" shares less than 40% identity, 
20 though preferably less than 25% identity with a sequence of the present application. 
In comparing two sequences, the absence of residues (amino acids or nucleic acids) 
or presence of extra residues also decreases the identity and homology/similarity. 

The term "homology" describes a mathematically based comparison of 
sequence similarities which is used to identify genes or proteins with similar 
25 functions or motifs. The nucleic acid and protein sequences of the present 
application may be used as a "query sequence" to perform a search against public 
databases to, for example, identify other family members, related sequences or 
homologs. Such searches can be performed using the NBLAST and XBLAST 
programs (version 2.0) of Altschul, et al. (1990) J Mol. Biol. 215:403-10. BLAST 
30 nucleotide searches can be performed with the NBLAST program, score=100, 
wordlength=12 to obtain nucleotide sequences homologous to nucleic acid 
molecules of the application. BLAST protein searches can be performed with the 
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XBLAST program, score=50, wordlength=3 to obtain amino acid sequences 
homologous to protein molecules of the application. To obtain gapped alignments 
for comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
aL, (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and 

5 Gapped BLAST programs, the default parameters of the respective programs (e.g., 
XBLAST and BLAST) can be used. See http://www.ncbi.nlm.nih.gov. 

As used herein, "identity" means the percentage of identical nucleotide or 
amino acid residues at corresponding positions in two or more sequences when the 
sequences are aligned to maximize sequence matching, i.e., taking into account gaps 

10 and insertions. Identity can be readily calculated by known methods, including but 
not limited to those described in (Computational Molecular Biology, Lesk, A. M., 
ed., Oxford University Press, New York, 1988; Biocotnputing: Informatics and 
Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana 
. 15 Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SLAM 
J. Applied Math., 48: 1073 (1988). Methods to determine identity are designed to 
give the largest match between the sequences tested. Moreover, methods to 

20 determine identity are codified in publicly available computer programs. Computer 
program methods to determine identity between two sequences include, but are not 
limited to, the GCG program package (Devereux, J., et aL, Nucleic Acids Research 
12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S. F. et aL, J. 
Molec. Biol. 215: 403-410 (1990) and Altschul et aL Nuc. Acids Res. 25: 3389-3402 

25 (1 997)). The BLAST X program is publicly available from NCBI and other sources 
(BLAST Manual, Altschul, S., et aL, NCBI NLM NIH Bethesda, Md. 20894; 
Altschul, S., et aL, J. Mol. BioL 215: 403-410 (1990). The well known Smith 
Waterman algorithm may also be used to determine identity. 

The term "intron" refers to a portion of nucleic acid that is intially 

30 transcribed into RNA but later removed such that it is not, for the most part, 
represented in the processed mRNA. Intron removal occurs through reactions at the 
5* and 3* ends, typically referred to as 5' and 3' splice sites, respectively. Alternate 
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use of different splice sites results in splice variants. An intron is not necessarily 
situated between two "exons", or portions that code for amino acids, but may instead 
be positioned, for example, between the promoter and the first exon. An intron may 
be self-splicing or may require cellular components to be spliced out of the mRNA. 
A "heterologous intron" is an intron that is inserted into a coding sequence that is 
not naturally associated with that coding sequence. In addition, a heterologous 
intron may be a genrally natural intron wherein one or both of the splice sites have 
been altered to provide a desired quality, such as increased or descreased splice 
efficiency. Heterologous introns are often inserted, for example, to improve 
expression of a gene in a heterologous host, or to increase the production of one 
splice variant relative to another. As an example, the rabbit beta-gtobin gene may be 
used, and is commercially available on the pCI vector from Promega Inc. Other 
exemplary introns are provided in Lacy-Hulbert et al. (2001) Gene Ther 8(8):649- 
53. 

The term "isolated", as used herein with reference to the subject proteins and 
protein complexes, refers to a preparation of protein or protein complex that is 
essentially free from contaminating proteins that normally would be present with the 
protein or complex, e.g., in the cellular milieu in which the protein or complex is 
found endogenously. Thus, an isolated protein complex is isolated from cellular 
components that normally would "contaminate" or interfere with the study of the 
complex in isolation, for instance while screening for modulators thereof. It is to be 
understood, however, that such an "isolated" complex may incorporate other 
proteins the modulation of which, by the subject protein or protein complex, is being 
investigated. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
DNA or RNA, refers to molecules in a form which does not occur in nature. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments 
which are not naturally occurring as fragments and would not be found in the natural 
state. 

Lentiviruses include primate Antiviruses, e.g., human immunodeficiency 
virus types 1 and 2 (HIV-l/HIV-2); simian immunodeficiency virus (SIV) from 
Chimpanzee (SIVcpz), Sooty mangabey (SIVsmm), African Green Monkey 
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(SIVagm), Syke's monkey (SIVsyk), Mandrill (SIVmnd) and Macaque (SWmac). 
Lentiviruses also include feline lentiviruses, e.g., Feline immunodeficiency virus 
(FIV); Bovine lentiviruses, e.g., Bovine immunodeficiency virus (BIV); Ovine 
lentiviruses, e.g., Maedi/Visna virus (MW) and Caprine arthritis encephalitis virus 
5 (CAEV); and Equine lentiviruses, e.g., Equine infectious anemia virus (EIAV). All 
lentiviruses express at least two additional regulatory proteins (Tat, Rev) in addition 
to Gag, Pol, and Env proteins. Primate lentiviruses produce other accessory proteins 
including Nef, Vpr, Vpu, Vpx, and Vif. Generally, lentiviruses are the causative 
agents of a variety of disease, including, in addition to immunodeficiency, 
10 neurological degeneration, and arthritis. Nucleotide sequences of the various 
lentiviruses can be found in Genbank under the following Accession Nos. (from J. 
M. Coffin, S. H. Hughes, and H. E. Varmus, "Retroviruses" Cold Spring Harbor 
Laboratoiy Press, 199,7 p 804): 1) HIV-1: K03455, M19921, K02013, M3843 1, 
M38429, K02007 and M17449; 2) HIV-2: M30502, J04542, M30895, J04498, 
15 M15390, M31113 and L07625; 3) SIV:M29975, M30931, M58410, M66437, 
L06042, M33262, M19499, M32741, M31345 and L03295; 4) FIV: M25381, 
M36968 and Ul 1820; 5)BIV. M32690; 6)E1AV: M16575, M87581 and U01866; 
6)Visna: M10608, M51543, L06906, M6060? and M60610; 7) CAEV: M33677; ' 
and 8) Ovine lentivirus M31646 and M34193. Lentiviral DNA can also be obtained 
20 from the American Type Culture Collection (ATCC). For example, feline 
immunodeficienqy virus is available under ATCC Designation No. VR-2333 and 
VR-3112. Equine infectious anemia virus A is available under ATCC Designation 
No. VR-778. Caprine arthritis-encephalitis virus is available under ATCC 
Designation No. VR-905. Visna virus is available under ATCC Designation No. 
25 VR-779. As used herein, the term "nucleic acid" refers to polynucleotides such 
as deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). 
The term should also be understood to include, as equivalents, analogs of either 
RNA or DNA made from nucleotide analogs, and, as applicable to the embodiment 
being described, single-stranded (such as sense or antisense) and double-stranded 
30 polynucleotides. 

The term "maturation" as used herein refers to the production, post- 
translational processing, assembly and/or release of proteins that form a viral 
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particle. Accrodingly, this includes the processing of viral proteins leading to the 
pinching off of nascent virion from the cell membrane. 

A "membrane associated protein" is meant to include proteins that are 
integral membrane proteins as well as proteins that are stably associated with a 
5 membrane. 

The term "p6" or p6gag" is used herein to refer to a protein comprising a 
viral L domain. Antibodies that bind to a p6 domain are referred to as "anti-p6 
antibodies". p6 also refers to proteins that comprise artificially engineered L 
domains including, for example, L domains comprising a series of L motifs. 

10 The term "Gag protein" or "Gag polypeptide" refers to a polypeptide having 

Gag activity and preferably comprising an L (or late) domain. Exemplary Gag 
proteins include a motif such as PXXP, PPXY, RXXPXXP, RPDPTAP, RPLPVAP, 
RPEPTAP, YEDL, PTAPPEY and/or RPEPTAPPEE. HIV p24 is an exemplary 
Gag polypeptide. Gag-Pol proteins, such as the HIV pi 60 Gag-Pol are also Gag 

15 proteins 

A "POSH nucleic acid" is a nucleic acid comprising a sequence as 
represented in any of SEQ ID Nos: 1, 3, 4, 6, 8, and 10 as well as any of the variants 
described herein. 

A "POSH polypeptide" or "POSH protein" is a polypeptide comprising a 
20 sequence as represented in any of SEQ ID Nos: 2, 5, 7, 9 andl 1 as well as any of the 
variations described herein. 

A "POSH-associated protein" or "POSH-AP" Tefers to a protein capable of 
interacting with and/or binding to a POSH polypeptide. Generally, the POSH-AP 
may interact directly or indirectly with the POSH polypeptide. Exemplary POSH- 
25 APs are provided throughout. 

The terms peptides, proteins and polypeptides are used interchangeably 

herein. 

The term "purified protein" refers to a preparation of a protein or proteins 
which are preferably isolated from, or otherwise substantially free of, other proteins 
30 normally associated with the protein(s) in a cell or cell lysate. The term 
"substantially free of other cellular proteins" (also referred to herein as "substantially 
free of other contaminating proteins") is defined as encompassing individual 
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preparations of each of the component proteins comprising less than 20% (by dry 
weight) contaminating protein, and preferably comprises less than 5% contaminating 
protein. Functional forms of each of the component proteins can be prepared as 
purified preparations by using a cloned gene as described in the attached examples. 
5 By "purified", it is meant, when referring to component protein preparations used to 
generate a reconstituted protein mixture, that the indicated molecule is present in the 
substantial absence of other biological macromolecules, such as other proteins 
(particularly other proteins which may substantially mask, diminish, confuse or alter 
the characteristics of the component proteins either as purified preparations or in 
10 their function in the subject reconstituted mixture). The. term "purified" as used 
herein preferably means at least 80% by dry weight, more preferably in the range of 
85% by weight, more preferably 95-99% by weight, and most preferably at least 
99.8% by weight, of biological macromolecules of the same type present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight 
1 5 of less than 5000, can be present). The term "pure" as used herein preferably has the 
.. . same numerical limits as "purified" immediately above. 

A "receptor" or "protein having a receptor function" is a protein that 
interacts with an extracellular ligand or a ligand that is within the cell but in a space 
that is topologically equivalent to the extracellular space (eg. inside the Golgi, inside 
20 the endoplasmic reticulum, inside the nuclear membrane, inside a lysosome or 
transport vesicle, etc.). Exemplary receptors are identified herein by annotation as 
such in various public databases. Receptors often have membrane domains. 

A **recombinant nucleic acid" is any nucleic acid that has been placed 
adjacent to another nucleic acid by recombinant DNA techniques. A "recombined 
25 nucleic acid" also includes any nucleic acid that has been placed next to a second 
nucleic acid by a laboratory genetic technique such as, for example, tranformation 
and integration, transposon hopping or viral insertion. In general, a recombined 
nucleic acid is not naturally located adjacent to the second nucleic acid. 

The term "recombinant protein" refers to a protein of the present application 
30 which is produced by recombinant DNA techniques, wherein generally DNA 
encoding the expressed protein is inserted into a suitable expression vector which is 
in turn used to transform a host cell to produce the heterologous protein. Moreover, 
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the phrase "derived from", with respect to a recombinant gene encoding the 

recombinant protein is meant to include within the meaning of "recombinant 

protein" those proteins having an amino acid sequence of a native protein, or an 

amino acid sequence similar thereto which is generated by mutations including 

5 substitutions and deletions of a naturally occurring protein. 

A "RING domain" or "Ring Finger" is a zinc-binding domain with a defined 

octet of cysteine and histidine residues. Certain RING domains comprise the 

consensus sequences as set forth below (amino acid nomenclature is as set forth in 

Table 1): Cys Xaa Xaa Cys Xaa i0 - 20 Cys Xaa His Xaa 2 . 5 Cys Xaa Xaa Cys Xaaj^so 

10 Cys Xaa Xaa Cys or Cys Xaa Xaa Cys Xaaio - 20 Cys Xaa His Xaa 2 - 5 His Xaa Xaa 

Cys Xaa^o Cys Xaa Xaa Cys. Certain RING domains are represented as amino 

acid sequences that are at least 80% identical to amino acids 12-52 of SEQ ID NO: 2 

and is set forth in SEQ ID No: 26. Preferred RING domains are 85%, 90%, 95%, 

98% and, most preferably, 100% identical to the amino acid sequence of SEQ ID 

15 NO: 26. Preferred RING domains of the application bind to various protein partners 

to form a complex that has ubiquitin ligase activity. RING domains preferably 

interact with at least one of the following protein types: F box proteins, E2 ubiquitin 

conjugating enzymes and cullins. 

The term "RNA interference" or "RNAi" refers to any method by which 

20 expression of a gene or gene product is decreased by introducing into a target cell 

one or more double-stranded RN As which are homologous to the gene of interest 

(particularly to the messenger RNA of the gene of interest). 

"Small molecule" as used herein, is meant to refer to a composition, which 

has a molecular weight of less than about 5 kD and most preferably less than about 

25 2.5 kD. Small molecules can be nucleic acids, peptides, polypeptides, 

peptidomimetics, carbohydrates, lipids or other organic (carbon containing) or 

» 

inorganic molecules. Many pharmaceutical companies have extensive libraries of 
chemical and/or biological mixtures comprising arrays of small molecules, often 
fungal, bacterial, or algal extracts, which can be screened with any of the assays of 
30 the application. 

An "SH3" or "Src Homology 3" domain is a protein domain of generally 
about 60 amino acid residues first identified as a conserved sequence in the non- 
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catalytic part of several cytoplasmic protein tyrosine kinases (e.g. Src, Abl, Lck). 
SH3 domains mediate assembly of specific protein complexes via binding to 
proline-rich peptides. Exemplary SH3 domains are represented by amino acids 137- 
192, 199-258, 448-505 and 832-888 of SEQ ID NO:2 and are set forth in SEQ ID 
5 Nos: 27-30. In certain embodiments, an SH3 domain interacts with a consensus 
sequence of RXaaXaaPXaaX6P (where X6, as defined in table 1 below, is a 
hydrophobic amino acid). In certain embodiments, an SH3 domain interacts with 
one or more of the following sequences: P(T/S)AP, PFRDY, RPEPTAP, 
RQGPKEP, RQGPKEPFR, RPEPTAPEE and RPLPVAP. 
10 As used herein, the term "specifically hybridizes" refers to the ability of a 

nucleic acid probe/primer of the application to hybridize to at least 12, 15, 20, 25, 
30, 35, 40, 45, 50 or 100 consecutive nucleotides of a POSH sequence, or a 
sequence complementary thereto, or naturally occurring mutants thereof, such that it 
has less than 15%, preferably less than 10%, and more preferably, less than 5% 
15 background hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) 
•other than the POSH gene. A variety of hybridization conditions may be used to 
detect specific hybridization, and the stringency is determined primarily by the wash 
stage of the hybridization assay. Generally high temperatures and low salt 
concentrations give high stringency, while low temperatures and high salt 
20 concentrations give low stringency. Low stringency hybridization is achieved by 
washing in, for example, about 2.0 x SSC at 50 °C, and high stringency is acheived 
with about 0.2 x SSC at 50 °C. Further descriptions of stringency are provided 
below. 

As applied to polypeptides, "substantial sequence identity" means that two 
25 peptide sequences, when optimally aligned, such as by the programs GAP or 
BESTFIT using default gap which share at least 90 percent sequence identity, 
preferably at least 95 percent sequence identity, more preferably at least 99 percent 
sequence identity or more. Preferably, residue positions which are not identical 
differ by conservative amino acid substitutions. For example, the substitution of 
30 amino acids having similar chemical properties such as charge or polarity are not 
likely to effect the properties of a protein. Examples include glutamine for 
asparagine or glutamic acid for aspartic acid. 
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"Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and 
promoters, which induce or control transcription of protein coding sequences with 
which they are operably linked. In preferred embodiments, transcription of a 
5 recombinant protein gene is under the control of a promoter sequence (or other 
transcriptional regulatory sequence) which controls the expression of the 
recombinant gene in a cell-type in which expression is intended. It will also be 
understood that the recombinant gene can be under the control of transcriptional 
regulatory sequences which are the same or which are different from those 
1 0 sequences which control transcription of the naturally-occurring form of the protein. 

As used herein, a "transgenic animal" is any animal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal 
contain heterologous nucleic acid introduced by way of human intervention, such as 
by transgenic techniques well known in the art. The nucleic acid is introduced into 
15 - .the cell, directly or indirectly by introduction into a precursor of the cell, by way of 
deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. The term genetic manipulation does not include classical cross- 
breeding, or in vitro fertilization, but rather is directed to the introduction of a 
recombinant DNA molecule. This molecule may be integrated within a 
20 chromosome, or it may be extrachromosomally replicating DNA. In the typical 
transgenic animals described herein, the transgene causes cells to express a 
recombinant human POSH protein. The "non-human animals" of the application 
include vertebrates such as rodents, non-human primates, sheep, dog, cow, chickens, 
amphibians, reptiles, etc. Preferred non-human animals are selected from the rodent 
25 family including rat and mouse, most preferably mouse, though transgenic 
amphibians, such as members of the Xenopus genus, and transgenic chickens can 
also provide important tools for understanding and identifying agents which can 
affect, for example, embryogenesis and tissue formation. The term "chimeric 
animal" is used herein to refer to animals in which the recombinant gene is found, or 
30 in which the recombinant is expressed in some but not all cells of the animal. The 
term "tissue specific chimeric animal" indicates that the recombinant human POSH 
genes is present and/or expressed in some tissues but not others. 
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As used herein, the term "transgene" means a nucleic acid sequence 
(encoding, e.g., human POSH polypeptides), which is partly or entirely 
heterologous, i.e., foreign, to the transgenic animal or cell into which it is 
introduced, or, is homologous to an endogenous gene of the transgenic animal or cell 

5 into which it is introduced, but which is designed to be inserted, or is inserted, into 
the animal's genome in such a way as to alter the genome of the cell into which it is 
inserted (e.g., it is inserted at a location which differs from that of the natural gene or 
its insertion results in a knockout). A transgene can include one or more 
transcriptional regulatory sequences and any other nucleic acid, such as introns, that 

1 0 may be necessary for optimal expression of a selected nucleic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be 
identical or may have certain modifications, including nucleotide substitutions, 
additions or deletions, which all still code for polypeptides having substantially the 
15 same activity. 

A "virion" is a complete viral particle; nucleic acid and capsid (and a lipid 
envelope in some viruses. - 



Table 1: Abbreviations for classes of amino acids* 



Symbol 


Category 


Amino Acids 
Represented 


XI 


Alcohol 


Ser, Thr 


X2 


Aliphatic 


He, Leu, Val 


Xaa 


Any 


Ala, Cys, Asp, Glu, Phe, 
Gly, His, He, Lys, Leu, 
Met, Asn, Pro, Gin, Arg, 
Ser, Thr, Val, Tip, Tyr 
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X4 


Aromatic 


Phe, His, Trp, Tyr 


X5 


Charged 


Asp, Glu, His, Lys, Arg 


X6 


Hydrophobic 


Ala, Cys, Phe, Gly, His, 
lie, Lys, Leu, Met, Thr, 
Val, Trp, Tyr 


X7 


Negative 


Asp, Glu j 


X8 


Polar 


Cys, Asp, Glu, His, Lys, 
Asn, Gin, Arg, Ser, Thr 


X9 


Positive 


His, Lys, Arg 


X10 


Small 


Ala, Cys, Asp, Gly, Asn, 
Pro, Ser, Thr, Val 


Xll 


Tiny 


Ala, Gly, Ser 


X12 


Turnlike 


Ala, Cys, Asp, Glu, Gly, 
His, Lys, Asn, Gin, Arg, 
Ser, Thr 


X13 


Asparagine-Aspartate 


Asn, Asp 


* Abbreviations as adopted 


from http://smart.embl- 



heidelberg.de/SMART_DATA/alignments/consensus/grouping.html. 



2. Overview 

In certain aspects, the application relates to the discovery of novel 
associations between POSH proteins and other proteins (termed POSH-APs), and 
related methods and compositions. In certain aspects, the application relates to 
novel associations between certain disease states and POSH nucleic acids and 
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proteins. In certain aspects, the application relates to novel associations between 
certain disease states and POSH-AP nucleic acids and proteins. 

POSH intersects with and regulates a wide range of key cellular functions 
that may be manipulated by affecting the level of and/or activity of POSH 
5 polypeptides or POSH-AP polypeptides. In certain aspects, by identifying proteins 
associated with POSH, and particularly human POSH the present application 
provides methods for identifying diseases that are associated with defects in the 
POSH gene and methods for ameliorating such diseases. In further aspects, the 
application provides nucleic acid agents (e.g. RNAi probes, antisense), antibody- 
10 related agents, small molecules and other agents that affect POSH function. In 
further aspects, the application provides methods for identifying agents that affect 
POSH function, and the function of proteins that associate with POSH and/or 
participate in a POSH or POSH-AP mediated process. Other aspects and 
embodiments are described herein. 
15 In certain aspects, the application relates to the discovery that certain POSH 

polypeptides function as E3 enzymes in the ubiquitination system. Accordingly, 
downregulation or upregulation of POSH ubiquitin ligase activity can be used to 
manipulate biological processes that are affected by protein ubiquitination. 
Downregulation or upregulation may be achieved at any stage of POSH formation 
20 and regulation, including transcriptional, translational or post-translational 
regulation. For example, POSH transcript levels may be decreased by RNAi 
targeted at a POSH gene sequence. As another example, POSH ubiquitin ligase 
activity may be inhibited by contacting POSH with an antibody that binds to and 
interferes with a POSH RING domain or a domain of POSH that mediates 
25 interaction with a target protein (a protein that is ubiquitinated at least in part 
because of POSH activity). As another example, POSH activity may be increased 
by causing increased expression of POSH or an active portion thereof. A ubiquitin 
ligase, such as POSH, and POSH-APs that modulate the POSH ubiquitin ligase 
activity may participate in biological processes including, for example, one or more 
30 of the various stages of a viral lifecycle, such as viral entry into a cell, production of 
viral proteins, assembly of viral proteins and release of viral particles from the cell. 
POSH may participate in diseases characterized by the accumulation of 
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ubiquitinated proteins, such as dementias (e.g. Alzheimer's and Pick's), inclusion 
body myositis and myopathies, polyglucosan body myopathy, and certain forms of 
amyotrophic lateral sclerosis. POSH may participate in diseases characterized by 
the excessive or inappropriate ubiquitination and/or protein degradation. In 
5 addition, POSH may participate in oncological processes, such as the failure of cell 
division control systems, the failure of cell death regulatory systems, and the failure 
to downregulate hyperactive oncogenes, such as hyperactive membrane-bound 
growth factor receptors. By identifying certain POSH polypeptides as ubiquitin 
ligases, aspects of the present application permit one of ordinary skill in the art to 
10 identify diseases that are associated with an altered POSH ubiquitin ligase activity. 

In certain aspects, the application relates to the discovery that certain POSH 
polypeptides and POSH-APs are involved in viral maturation, including the 
production, post-translational processing, assembly and/or release of proteins in a 
viral particle. Accordingly, viral infections may be ameliorated by inhibiting an 
15 activity (e.g. ubiquitin ligase activity or target protein interaction) of POSH, and in 
preferred embodiments, the virus is a retroid virus, an RNA virus and an envelope 
virus, including HIV, Ebola, HBV, HCV and HTLV. Additional viral species are 
described in greater detail below. In certain instances, a decrease of a POSH 
function is lethal to cells infected with a virus that employs POSH in release of viral 
20 particles. While not wishing to be bound to mechanism, it appears that loss of 
POSH function in such cells leads to cell death through an overaccumulation of viral 
particles, or portions thereof, in a the cell. In certain embodiments, the inhibition of 
a POSH activity, e.g. by siRNA knockdown of POSH or certain POSH-APs may be 
used to destroy infected cells, even cells with nearly latent virus, because such cells 
25 will die from eventual overaccumulation of viral particles or portions thereof. 

In certain aspects, the application relates to the discovery that hPOSH 
interacts with Rac, a small GTPase. Rho, Rac and Cdc42 operate together to 
regulate organization of the actin cytoskeleton and the MLK-MKK-JNK MAP 
kinase pathway (Xu et al., 2003, EMBO J. 2: 252-61). Ectopic expression of mouse 
30 POSH ("mPOSH") activates the JNK pathway and causes nuclear localization of 
NF-kB. Overexpression of mPOSH in fibroblasts stimulates apoptosis. (Tapon et 

al. (1998) EMBO J. 17:1395-404). In Drosophila, POSH may interact, or otherwise 

<. 
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influence the signaling of, another GTPase, Ras. (Schnoir et al. (2001) Genetics 
159: 609-22). The JNK pathway and NF-kB regulate a variety of key genes 
involved in, for example, immune responses, inflammation, cell proliferation and 
apoptosis. Fot example, NF-kB regulates the production of interleukin 1, 
5 interleukin 8, tumor necrosis factor and many cell adhesion molecules. NF-kB has 
both pro-apoptotic and anti-apoptotic roles in the cell (e.g. in FAS-induced cell 
death and TNF-alpha signaling, respectively). NF-kB is negatively regulated, in 
part, by the inhibitor proteins IkBoc and IkBP (collectively termed "IkB"). 
Phosphorylation of IkB permits activation and nuclear localization of NF-kB. 
10 Phosphorylation of IkB triggers its degradation by the ubiquitin system. 
Accordingly, in yet another embodiment, a POSH polypeptide stimulates the JNK 
pathway. In an additional embodiment, a POSH polypeptide promotes nuclear 
localization of NF-kB. In further embodiments, manipulation of POSH levels 
and/or activities may be used to manipulate apoptosis. By upregulating POSH, 
15 apoptosis may be stimulated in certain cells, and this will generally be desirable in 
conditions characterized by excessive cell proliferation (e.g. in certain cancers). By 
downregulating POSH, apoptosis may be diminished in certain cells, and this will 
generally be desirable in conditions characterized by excessive cell death, such as 
myocardial infarction, stroke, degenerative diseases of muscle and nerve, and for 
20 organ preservation prior to transplant In a further embodiment, a POSH 
polypeptide associates with a vesicular trafficking complex, such as a clathrin- or 
coatomer- containing complex, and particularly a trafficking complex that localizes 
to the nucleus and/or Golgi apparatus. 

In certain aspects, the application relates to the discovery that a POSH 
25 polypeptide interacts with human UNC84B, a human homolog of C. elegans Unc- 
84, In C. elegans, Unc-84 is involved in the cellular positioning of the nucleus. 
UNC84/SUN is positioned at the nuclear membrane and recruits Syne/ANC-1, 
which directly tethers the nuclear envelope to the actin cytoskeleton. Accordingly, 
in certain aspects, POSH participates in formation of a UNC84 complexes, including 
30 human UNC84B-containing complexes, and in the connections between the nucleus 
and the cytoskeleton. In certain aspects, UNC84 polypeptides participate in POSH- 
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mediated processes. See, for example, Starr and Han, 2003, J Cell Sci 1 16(Pt 2):21 1- 
6. The term UNC84 is used herein to refer to various naturally occurring Unc-84 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring TJNC84. The 
term specifically includes human UNC84B nucleic acid and amino acid sequences 
and the sequences presented in the Examples. 

In certain aspects, the application relates to the discovery mat a POSH 
polypeptide interacts with MSTP028. Certain MSTP028 polypeptides contain one 
or more BTB/POZ domains mat are generally involved in dimerization. 
Accordingly the application provides complexes comprising POSH and MSTP028, 
optionally in a dimeric form. The term MSTP028 is used herein to refer to various 
naturally occurring MSTP028 homologs, as well as functionally similar variants and 
fragments that retain at least 80%, 90%, 95%, or 99% sequence identity to a 
naturally occurring MSTP028. The term specifically includes human MSTP028 
nucleic acid and amino acid sequences and the sequences presented in the Examples. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with HERPTJD1, a "homocysteine-inducible, endoplasmic 
reticulum stress-inducible, ubiquitin-like domain member 1" protein. (In previous 
applications, HERPUD1 was referred to as HEPUD1). Certain HERPUD1 
polypeptides are involved in JNK-mediated apoptosis, particularly in vascular 
endothelial cells, including cells that are exposed to high levels of homocysteine. 
Certain HERPUD1 polypeptides are involved in the Unfolded Protein Response, a 
cellular response to the presence of unfolded proteins in the endoplasmic reticulum. 
Certain HERPUD1 polypeptides are involved in the regulation of sterol 
biosynthesis. Accordingly, certain POSH polypeptides are involved in the Unfolded 
Protein Response and sterol biosynthesis. In other aspects, certain HERPUD1 
polypeptides enhance presenilin-mediated amyloid P-protein generation. For 
example, HERPUD1 polypeptides, when overexpressed in cells, increase the level of 
amyloid 0 generation, and it is observed that HERPUD1 polypeptides interact with 
the presenilin proteins, presenilin-1 and presenilin-2. (See Sai, X. et al (2002) J. 
Biol. Chem. 277:12915-12920). Accoidingly, in certain aspects, POSH 
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polypeptides may modulate the level of amyloid p generation. Additionally, POSH 
polypeptides may interact with presenilin 1 and presenilin 2. Therefore, it is 
believed certain POSH polypeptides modulate presenilin-mediated amyloid P 
generation. The accumulation of amyloid P is one hallmark of Alzheimer's disease. 
5 Accordingly, these POSH polypeptides may be involved in the pathogenesis of 
Alzheimer's disease. At sites such as late intracellular compartment sites including 
the trans-Golgi network, certain mutant presenilin-2 polypeptides up-regulate 
production of amyloid p peptides ending at position 42 (Ap42). (See Iwata, H. et al 
(2001) J. Biol. Chem. 276: 21678-21685). Accordingly, POSH polypeptides 

10 regulate production of Ap42 through mutant presenilin-2 at late intracellular 
compartment sites including the trans-Golgi network. Furthermore, elevated 
homocysteine levels have been found to be a risk factor associated with Alzheimer's 
disease and cerebral vascular disease. Some risk factors, such as elevated plasma 
homocysteine levels, may. accelerate or increase the severity of several central 

15. nervous system (CNS) disorders: Elevated levels of plasma homocysteine were 
found in young male patients with schizophrenia suggesting that elevated 
homocysteine levels could be related to the pathophysiology of aspects of 
schizophrenia (Levine, J. et al (2002) Am. J. Psychiatry 159:1790-2). Accordingly, 
certain POSH polypeptides may be involved in neurological disorders. Neurological 

20 disorders include disorders associated with increased levels of plasma homocysteine, 
increased levels of amyloid P production, or abberent presenilin acitivity. 
Neurological disorders include CNS disorders, such as Alzheimer's disease, cerebral 
vascular disease and schizophrenia. Certain POSH polypeptides may be involved in 
cardiovascular diseases, such as thromboembolic vascular disease, and particularly 

25 the disease characteristics associated with hyperhomocysteinemia. See, for 
example, Kokame et al. 2000 J. Biol. Chem. 275:32846-53; Zhang et al. 2001 
Biochem Biophys Res Commun 289:718-24. The term HERPUD1 is used herein to 
refer to various naturally occurring HERPUD1 homologs, as well as functionally 
similar variants and fragments that retain at least 80%, 90%, 95%, or 99% sequence 

30 identity to a naturally occurring HERPUDL The term specifically includes human 
HERPUD1 nucleic acid and amino acid sequences and the sequences presented in 
the Examples. 
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3. Exemplary Nucleic Acids and Expression Vectors 

In certain aspects the application provides nucleic acids encoding POSH 
polypeptides, such as, for example, SEQ ID Nos: 2, 5, 7, 9, 11, 26, 27, 28, 29 and 
30. Nucleic acids of the application are further understood to include nucleic acids 

5 that comprise variants of SEQ ID Nos:l, 3, 4, 6, 8, 10, 31, 32, 33, 34, and 35. 
Variant nucleotide sequences include sequences that differ by one or more 
nucleotide substitutions, additions or deletions, such as allelic variants; and will, 
therefore, include coding sequences that differ from the nucleotide sequence of the 
coding sequence designated in SEQ ID Nos:l, 3, 4, 6, 8 10, 31, 32, 33, 34, and 35, 

10 e.g., due to the degeneracy of the genetic code. In other embodiments, variants will 
also include sequences that will hybridize under highly stringent conditions to a 
nucleotide sequence of a coding sequence designated in any of SEQ ID Nosrl, 3, 4, 
6, 8 10, 31, 32, 33, 34, and 35. Preferred nucleic acids of the application are human 
POSH sequences, including, for example, any of SEQ ID Nos: 1, 3, 4, 6, 31, 32, 33, 

15 34, 35 and variants thereof and nucleic acids encoding an amino acid sequence 
selected from among SEQ ID Nos: 2, 5, 7, 26, 27, 28, 29 and 30. 

One of ordinary skill in the art will understand readily that appropriate 
stringency conditions which promote DNA hybridization can be varied. For 
example, one could perform the hybridization at 6.0 x sodium chloride/sodium 

20 citrate (SSC) at about 45 °C, followed by a wash of 2.0 x SSC at 50 °C. For 
example, the salt concentration in the wash step can be selected from a low 
stringency of about 2.0 x SSC at 50 °C to a high stringency of about 0.2 x SSC at 50 
°C. In addition, the temperature in the wash step can be increased from low 
stringency conditions at room temperature, about 22 °C, to high stringency 

25 conditions at about 65 °C. Both temperature and salt may be varied, or temperature 
or salt concentration may be held constant while the other variable is changed. In 
one embodiment, the application provides nucleic acids which hybridize under low 
stringency conditions of 6 x SSC at room temperature followed by a wash at 2 x 
SSC at room temperature. 

30 Isolated nucleic acids which differ from SEQ ID Nos: 1 , 3, 4, 6, 8, 1 0, 3 1 , 32, 

33, 34, and 35 due to degeneracy in the genetic code are also within the scope of the 
application. For example, a number of amino acids are designated by more than one 
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triplet. Codons that specify the same amino acid, or synonyms (for example, CAU 
and CAC are synonyms for histidine) may result in "silent" mutations which do not 
affect the amino acid sequence of the protein. However, it is expected that DNA 
sequence polymorphisms that do lead to changes in the amino acid sequences of the 

S subject proteins will exist among mammalian cells. One skilled in the art will 
appreciate that these variations in one or more nucleotides (up to about 3-5% of the 
nucleotides) of the nucleic acids encoding a particular protein may exist among 
individuals of a given species due to natural allelic variation. Any and all such 
nucleotide variations and resulting amino acid polymorphisms are within the scope 

10 of this application. 

Optionally, a POSH nucleic acid of the application will genetically 
complement a partial or complete POSH loss of function phenotype in a cell. For 
example, a POSH nucleic acid of the application may be expressed in a cell in which 
endogenous POSH has been reduced by RNAi, and the introduced POSH nucleic 

15 acid will mitigate a phenotype resulting from the RNAi. An exemplary POSH loss 
of function phenotype is a decrease in virus-like particle production in a cell 
transfected with a viral vector, optionally an HTV vector. In certain embodiments, a 
POSH nucleic acid, when expressed at an effective level in a cell, induces apoptosis. 
Another aspect of the application relates to POSH nucleic acids that are used 

20 for antisense, RNAi or ribozymes. As used herein, nucleic acid therapy refers to 
administration or in situ generation of a nucleic acid or a derivative thereof which 
specifically hybridizes (e.g. binds) under cellular conditions with the cellular mRNA 
and/or genomic DNA encoding one of the subject POSH polypeptides so as to 
inhibit production of that protein, e.g. by inhibiting transcription and/or translation. 

25 The binding may be by conventional base pair complementarity, or, for example, in 
the case of binding to DNA duplexes, through specific interactions in the major 
groove of the double helix. 

An nucleic acid therapy construct of the present application can be delivered, 
for example, as an expression plasmid which, when transcribed in the cell, produces 

30 RNA which is complementary to at least a unique portion of the cellular mRNA 
which encodes a POSH polypeptide. Alternatively, the the construct is an 
oligonucleotide which is generated ex vivo and which, when introduced into the cell 
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causes inhibition of expression by hybridizing with the mRNA and/or genomic 
sequences encoding a POSH polypeptide. Such oligonucleotide probes are 
optionally modified oligonucleotide which are resistant to endogenous nucleases, 
e.g. exonucleases and/or endonucleases, and is therefore stable in vivo. Exemplary 
nucleic acid molecules for use as antisense oligonucleotides are phosphoramidate, 
phosphothioate and methylphosphonate analogs of DNA (see also U.S. Patents 
5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to 
constructing oligomers useful in nucleic acid therapy have been reviewed, for 
example, by van der Krol et al., (1988) Biotechniques 6:958-976; and Stein et al., 
(1988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the application are useful in 
therapeutic, diagnostic, and research contexts. In therapeutic applications, the 
oligomers are utilized in a manner appropriate for nucleic acid therapy in general. 

In addition to use in therapy, the oligomers of the application may be used as 
diagnostic reagents to detect the presence or absence of the POSH DNA or RNA 
sequences to which they specifically bind, such as for determining the level of 
expression of a gene of the application or for determining whether a gene of the 
application contains a genetic lesion. 

In another aspect of the application, the subject nucleic acid is provided in an 
expression vector comprising a nucleotide sequence encoding a subject POSH 
polypeptide and operably linked to at least one regulatory sequence. Regulatory 
sequences are art-recognized and are selected to direct expression of the POSH 
polypeptide. Accordingly, the term regulatory sequence includes promoters, 
enhancers and other expression control elements. Exemplary regulatory sequences 
are described in Goeddel; Gene Expression Technology: Methods in Enzymology, 
Academic Press, San Diego, CA (1990). For instance, any of a wide variety of 
expression control sequences that control the expression of a DNA sequence when 
operatively linked to it may be used in these vectors to express DNA sequences 
encoding a POSH polypeptide. Such useful expression control sequences, include, 
for example, the early and late promoters of SV40, tet promoter, adenovims or 
cytomegalovirus immediate early promoter, the lac system, the trp system, the TAC 
or TRC system, T7 promoter whose expression is directed by T7 RNA polymerase, 
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the major operator and promoter regions of phage lambda , the control regions for fd 
coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic 
enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast 
a-mating factors, the polyhedron promoter of the baculovirus system and other 
5 sequences known to control the expression of genes of prokaryotic or eukaryotic 
cells or their viruses, and various combinations thereof. It should be understood that 
the design of the expression vector may depend on such factors as the choice of the 
host cell to be transformed and/or the type of protein desired to be expressed. 
Moreover, the vector's copy number, the ability to control that copy number and the 
10 expression of any other protein encoded by the vector, such as antibiotic markers, 
should also be considered. 

As will be apparent, the subject gene constructs can be used to cause 
expression of the subject POSH polypeptides in cells propagated in culture, e.g. to 
produce proteins or polypeptides, including fusion proteins or polypeptides, for 
15 purification. - 

This application also pertains to a host cell transfected with a recombinant 
gene including a coding sequence -for * one- or more of the subject POSH 
polypeptides. The host cell may be any prokaryotic or eukaryotic cell. For 
example, a polypeptide of the present application may be expressed in bacterial cells 
20 such as E. coli, insect cells (e.g., using a baculovirus expression system), yeast, or 
mammalian ceils. Other suitable host cells are known to those skilled in the art 

Accordingly, the present application further pertains to methods of 
producing the subject POSH polypeptides. For example, a host cell transfected with 
an expression vector encoding a POSH polypeptide can be cultured under 
25 appropriate conditions to allow expression of the polypeptide to occur. The 
polypeptide may be secreted and isolated from a mixture of cells and medium 
containing the polypeptide. Alternatively, the polypeptide may be retained 
cytoplasmically and the cells harvested, lysed and the protein isolated. A cell 
culture includes host cells, media and other byproducts. Suitable media for cell 
30 culture are well known in the art. The polypeptide can be isolated from cell culture 
medium, host cells, or both using techniques known in the art for purifying proteins, 
including ion-exchange chromatography, gel filtration chromatography, 



-38- 



ultrafiltration, electrophoresis, and immimoaffinity purification with antibodies 
specific for particular epitopes of the polypeptide. In a preferred embodiment, the 
POSH polypeptide is a fusion protein containing a domain which facilitates its 
purification, such as a POSH-GST fusion protein, POSH-intein fusion protein, 
5 POSH-cellulose binding domain fusion protein, POSH-polyhistidine fusion protein 
etc. 

A nucleotide sequence encoding a POSH polypeptide can be used to produce 
a recombinant form of the protein via microbial or eukaryotic cellular processes. 
Ligating the polynucleotide sequence into a gene construct, such as an expression 

10 vector, and transforming or transfecting into hosts, either eukaryotic (yeast, avian, 
insect or mammalian) or prokaryotic (bacterial) cells, are standard procedures. 

A recombinant POSH nucleic acid can be produced by ligating the cloned 
gene, or a portion thereof, into a vector suitable for expression in either prokaryotic 
cells, eukaryotic cells, or both. Expression vehicles for production of a recombinant 

15 POSH polypeptides include plasmids and other vectors. For instance, suitable 
vectors for the expression of a POSH polypeptide include plasmids of the types: 
pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived plasmids, 
pBTac-derived plasmids and pUC-derived plasmids for expression in prokaryotic 
cells, such as 2s. colL 

20 A number of vectors exist for the expression of recombinant proteins in 

yeast For instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are cloning 
and expression vehicles useful in the introduction of genetic constructs into S. 
cerevisiae (see, for example, Broach et al., (1983) in 
Experimental Manipulation of Gene Expression, ed. M. Inouye Academic Press, p. 

25 83, incorporated by reference herein). These vectors can replicate in E, coli due the 
presence of the pBR322 ori, and in S. cerevisiae due to the replication determinant 
of the yeast 2 micron plasmid. In addition, drug resistance markers such as 
ampicillin can be used. 

The preferred mammalian expression vectors contain both prokaryotic 

30 sequences to facilitate the propagation of the vector in bacteria, and one or more 
eukaryotic transcription units that are expressed in eukaryotic cells. The 
pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, 
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pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of 
mammalian expression vectors suitable for transfection of eukaryotic cells. Some of 
these vectors are modified with sequences from bacteria] plasmids, such as pBR322, 
to facilitate replication and drug resistance selection in both prokaryotic and 
5 eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma 
virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used 
for transient expression of proteins in eukaryotic cells. Examples of other viral 
(including retroviral) expression systems can be found below in the description of 
gene therapy delivery systems. The various methods employed in the preparation of 

10 the plasmids and transformation of host organisms are well known in the art. For 
other suitable expression systems for both prokaiyotic and eukaryotic cells, as well 
as general recombinant procedures, see Molecular Cloning A Laboratory Manual, 
2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory 
Press, 1989) Chapters 16 and 17. In some instances, it may be desirable to express 

15 the recombinant POSH polypeptide by the use of a baculovirus expression system. 
Examples of such baculovirus expression systems include pVL-derived vectors 
(such as pVL1392, pVL1393 and pVL941),.,pAcUW-derived vectors, (such as 
pAcUWl), and pBlueBac-derived vectors (such as the B-gal containing pBlueBac 

m). 

20 It is well known in the art that a methionine at the N-terminal position can be 

enzymatically cleaved by the use of the enzyme methionine aminopeptidase (MAP). 
MAP has been cloned from E. coli (Ben-Bassat et aL, (1987) J. Bacteriol. 169:751- 
757) and Salmonella typhimurium and its in vitro activity has been demonstrated on 
recombinant proteins (Miller et aL, (1987) PNAS USA £4:2718-1722). Therefore, 

25 removal of an N-terminal methionine, if desired, can be achieved either in vivo by 
expressing such recombinant polypeptides in a host which produces MAP (e.g., E. 
coli or CM89 or & cerevisiae), or in vitro by use of purified MAP (e.g., procedure 
ofMilleretal.). 

Alternatively, the coding sequences for the polypeptide can be incorporated 
30 as a part of a fusion gene including a nucleotide sequence encoding a different 
polypeptide. This type of expression system can be useful under conditions where it 
is desirable, e.g., to produce an immunogenic fragment of a POSH polypeptide. For 
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example, the VP6 capsid protein of rotavirus can be used as an immunologic carrier 
protein for portions of polypeptide, either in the monomelic form or in the form of a 
viral particle. The nucleic acid sequences corresponding to the portion of the POSH 
polypeptide to which antibodies are to be raised can be incorporated into a fusion 
5 gene construct which includes coding sequences for a late vaccinia virus structural 
protein to produce a set of recombinant viruses expressing fusion proteins 
comprising a portion of the protein as part of the virion. The Hepatitis B surface 
antigen can also be utilized in thi$ role as well. Similarly, chimeric constructs 
coding for fusion proteins containing a portion of a POSH polypeptide and the 

10 poliovirus capsid protein can be created to enhance immunogenicity (see, for 
example, EP Publication NO: 0259149; and Evans et al.„ (1989) Nature 339:385; 
Huang et aL, (1988) Virol 62:3855; and Schlienger et aL, (1992) J. Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can be 
utilized, wherein a desired portion of a POSH polypeptide is obtained directly from 

15 organo-chemical synthesis of the peptide onto an oligomeric branching lysine core 
(see, for example, Posnett et aL, (1988) JBC 263:1719,<and Nardelli et al., (1992) 
J. Immunol. 148:914). Antigenic determinants , of a POSH polypeptide can also be 
expressed and presented by bacterial cells. 

In another embodiment, a fusion gene coding for a purification leader 

20 sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N- 
terminus of the desired portion of the recombinant protein, can allow purification of 
the expressed fusion protein by affinity chromatography using a Ni^ + metal resin. 
The purification leader sequence can then be subsequently removed by treatment 
with enterokinase to provide the purified POSH polypeptide (e.g., see Hochuli et al., 

25 (1 987) J. Chromatography 41 1 : 1 77; and Janknecht et aL, PNAS USA 88:8972). 

Techniques for making fusion genes are well known. Essentially, the joining 
of various DNA fragments coding for different polypeptide sequences is performed 
in accordance with conventional techniques, employing blunt-ended or stagger- 
ended termini for ligation, restriction enzyme digestion to provide for appropriate 

30 termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the 
fusion gene can be synthesized by conventional techniques including automated 
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DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers which give rise to complementary overhangs 
between two consecutive gene fragments which can subsequently be annealed to 
generate a chimeric gene sequence (see, for example, Current Protocols in 
5 Molecular Biology, eds. Ausubel et aL, John Wiley & Sons: 1992). 



Table 2: Exemplary POSH nucleic acids 



Sequence Name 


Organism 


Accession Number 








cDNA FLJ11367 fis, clone 
HEMBA1000303 


Homo sapiens 


AK021429 


Plenty of SH3 domains 
(POSH) mRNA 


Mus musculus 


NM_021506 


Plenty of SH3s (POSH) 
mRNA 


Mus musculus 


AF030131 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


NM_079052 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


AF220364 


Table 3: Exemplary POSH polypeptides 


Sequence Name 


Organism 


Accession Number 








SH3 domains- 
containing protein POSH 


Mus musculus 


T09071 


plenty of SH3 domains 


Mus musculus 


NP_067481 


Plenty ofSH3s; POSH 


Mus musculus 


AAC40070 


Plenty of SH3s 


Drosophila melanogaster 


AAF372p5 
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LD45365p 


Drosophila melanogaster 


AAK93408 


POSH gene product 


Drosophila melanogaster 


AAF57833 


Plenty of SH3s 


Drosophila melanogaster 


NP.J23776 



In addition the following Tables provide the nucleic acid sequence and related SEQ 
ID NOs for domains of human POSH protein and a summary of sequence 
identification numbers used in this application. 
5 Table 4. Nucleic Acid Sequences and related SEQ ID NOs for domains in human 
POSH 



Name of the 
sequence 


Sequence 


SEQ ID 
NO. 


RING domain 


TGTCCGGTGTGTCTAGAGCGCCTTGATGCTTCTGCGAAGGTCT 
TGCCTTGCCAGCATACGTTTTGCAAGCGATGTTTGCT 

GGGGATCGTAGGTTCTCGAAATGAACTCAGATGTCCCGAGT 


31 


l 8t SH 3 
domain 


CCATGTGCCAAAGCGTTATACAACTATGAAGGAAAAGAGCCTG 
GAGACCTTAAATTCAGCAAAGGCGACATC^TCATTTT 

GCGAAGACAAGTGGATGAAAATTGGTACCATGGGGAAGTCAAT 
GGAATCCATGGCTTTTTCCCCACCAACTTTGTGCAGA 

TTATT 


32 


2 nd SH 3 
domain 


CCTCAGTGCAAAGCACTTTATGACTTTGAAGTGAAAGACAAGG 
AAGCAGACAAAGATTGCCTTCCATTTGCAAAGGATGA 

TGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTGAA 
GGAATGCTGGCAGACAAAATAGGAATATTTCCAATTT 

CATATGTTGAGTTTAAC 


33 


3** SH 3 
domain 


AGTGTGTATGTTGCTATATATCCATACACTCCTCGGAAAGAGG 
ATGAACTAGAGCTGAGAAAAGGGGAGATGTTTTTAGT 

GTTTGAGCGCTGCCAGGATGGCTGGTTCAAAGGGACATCCATG 
CATACCAGCAAGATAGGGGTTTTCCCTGGCAATTATG 


34 
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TGGCACCAGTC 




4 th SH 3 
domain 


GAAAGGCACAGGGTGGTGGTTTCCTATCCTCCTCAGAGTGAGG 
CAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGT 

TCATAAAAAACGAGAGGATGGCTGGTTCAAAGGCACATTACAA 
CGTAATGGGAAAACTGGCCTTTTCCCAGGAAGCTTTG 

TGGAAAACA 


35 



Table 5. Summary of Sequence Identification Numbers 



Sequence Information 


Sequence Identification Number 
(SEQ ID NO) 


Human POSH Coding Sequence . 


SEQ ID No: 1 


Human POSH Amino Acid Sequence 


SEQ ID No: 2 


Human POSH cDNA Sequence 


SEQ ID No: 3 


5' cDNA Fragment of Human POSH 


SEQ ID No: .4. 


N- terminus Protein Fragment of 
Human POSH 


SEQ ID. No: 5 


3' mRNA Fragment of Human POSH 


SEQ ID No: 6 


C- terminus Protein Fragment of 
Human POSH 


SEQ ID No: 7 


Mouse POSH mRNA Sequence 


SEQ ID No: 8 


Mouse POSH Protein Sequence 


SEQ ID No: 9 


Drosophila melanogaster POSH 
mRNA Sequence 


SEQ ID NO: 10 


Drosophila melanogaster POSH 
Protein Sequence 


SEQ ID No: 11 


Human POSH RING Domain Amino 
Acid Sequence 


SEQ ID No: 26 


Human POSH l ot SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 27 


Human POSH 2 nd SH 3 Domain Amino 
Acid Sequence 


SEQ ID NO: 28 


Human POSH 3 rt SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 29 


Human POSH 4 th SH 3 Domain Amino 


SEQ ID No: 30 
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Acid Sequence 




Human POSH RING Domain Nucleic 
Acid Sequence 


SEQ ID NO: 31 


Human POSH 1 st SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 32 


Human POSH 2 nd SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 33 


Human POSH 3 rd SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 34 


Human POSH 4 tn SH 3 Domain Nucleic 
Acid Sequence 


SEQ ID No: 35 



4. Exemplary Polypeptides 

The present application also makes available isolated and/or purified 
forms of the subject POSH polypeptides, which are isolated from, or otherwise 

5 substantially free of, other intracellular proteins which might normally be associated 
with the protein or a particular complex including the protein. In certain 
embodiments, POSH polypeptides have an amino acid sequence that is at least 60% 
identical to an amino acid sequence as set forth in any of SEQ ID Nos: 2, 5, 7, 9, 1 1, 
26, 27, 28, 29 and 30. In other embodiments, the polypeptide has an amino acid 

10 sequence at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% 
identical to an amino acid sequence as set forth in any of SEQ ID Nos: 2, 5, 7, 9, 1 1 , 
26, 27, 28, 29 and 30. 

Optionally, a POSH polypeptide of the application will function in place of 
an endogenous POSH polypeptide, for example by mitigating a partial or complete 

1 5 POSH loss of function phenotype in a cell. For example, a POSH polypeptide of the 
application may be produced in a cell in which endogenous POSH has been reduced 
by RNAi, and the introduced POSH polypeptide will mitigate a phenotype resulting 
from the RNAi. An exemplary POSH loss of function phenotype is a decrease in 
virus-like particle production in a cell transfected with a viral vector, optionally an 

20 HIV vector. In certain embodiments, a POSH polypeptide, when produced at an 
effective level in a cell, induces apoptosis. 

In certain embodiments, a POSH polypeptide of the application interacts 
with a viral Gag protein. In additional embodiments, POSH polypeptides may also, 
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or alternatively, fimction in ubiquitylation in part through the activity of a RING 
domain. In certain embodiments, POSH interacts with Gag polypeptides, and 
particularly Gag-Pol polypeptides, through ubiqutination mediated by the POSH 
RING domain. 

5 In another aspect, the application provides polypeptides that are agonists or 

antagonists of a POSH polypeptide. Variants and fragments of a POSH polypeptide 
may have a hyperactive or constitutive activity, or, alternatively, act to prevent 
POSH polypeptides from performing one or more functions. For example, a 
truncated form lacking one or more domain may have a dominant negative effect 

10 Another aspect of the application relates to polypeptides derived from a full- 

length POSH polypeptide. Isolated peptidyl portions of the subject proteins can be 
obtained by screening polypeptides recombinantly produced from the corresponding 
fragment of the nucleic acid encoding such polypeptides. In addition, fragments can 
be chemically synthesized using techniques known in the art such as conventional 

15 Merrifield solid phase f-Moc or t-Boc chemistry. For example, any one of the 
subject proteins can be arbitrarily divided into fragments of desired length with no 
overlap of the fragments, or preferably divided into overlapping fragments of a 
desired length. The fragments can be produced (recombinantly or by chemical 
synthesis) and tested to identify those peptidyl fragments which can function as 

20 either agonists or antagonists of the formation of a specific protein complex, or more 
generally of a POSH complex, such as by microinjection assays. 

It is also possible to modify the structure of the subject POSH polypeptides 
for such purposes as enhancing therapeutic or prophylactic efficacy, or stability 
(e.g., ex vivo shelf life and resistance to proteolytic degradation in vivo). Such 

25 modified polypeptides, when designed to retain at least one activity of the naturally- 
occurring form of the protein, are considered functional equivalents of the POSH 
polypeptides described in more detail herein. Such modified polypeptides can be 
produced, for instance, by amino acid substitution, deletion, or addition. 

For instance, it is reasonable to expect, for example, that an isolated 

30 replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, 
a threonine with a serine, or a similar replacement of an amino acid with a 
structurally related amino acid (i.e. conservative mutations) will not have a major 
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effect on the biological activity of the resulting molecule. Conservative replacements 
are those that take place within a family of amino acids that are related in their side 
chains. Genetically encoded amino acids are can be divided into four families: (1) 
acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) nonpolar = 

5 alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; 
and (4) uncharged polar = glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified 
jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be 
grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, 

10 (3) aliphatic = glycine, alanine, valine, leucine, isoleucine, serine, threonine, with 
serine and threonine optionally be grouped separately as aliphatic-hydroxyl; (4) 
aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, glutamine; 
and (6) sulfur -containing = cysteine and methionine. (see, for example, 
Biochemistry, 2nd ed., Ed. by L. Stryer, W.H. Freeman and Co., 1981). Whether a 

15 change in the amino acid sequence of a polypeptide results in.a functional homolog 
can be readily determined by assessing the ability of the variant polypeptide to 
produce a response in cells in a. fashion similar to the wild-type protein. For 
instance, such variant forms of a POSH polypeptide can be assessed, e.g., for their 
ability to bind to another polypeptide, e.g., another POSH polypeptide or another 

20 protein involved in viral maturation. Polypeptides in which more than one 
replacement has taken place can readily be tested in the same manner. 

This application further contemplates a method of generating sets of 
combinatorial mutants of the subject POSH polypeptides, as well as truncation 
mutants, and is especially useful for identifying potential variant sequences (e.g. 

25 homologs) that are functional in binding to a POSH polypeptide. The purpose of 
screening such combinatorial libraries is to generate, for example, POSH homologs 
which can act as either agonists or antagonist, or alternatively, which possess novel 
activities all together. Combinatorially-derived homologs can be generated which 
have a selective potency relative to a naturally occurring POSH polypeptide. Such 

30 proteins, when expressed from recombinant DNA constructs, can be used in gene 
therapy protocols. 
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Likewise, mutagenesis can give rise to homologs which have intracellular 
half-lives dramatically different than the corresponding wild-type protein. For 
example, the altered protein can be rendered either more stable or less stable to 
proteolytic degradation or other cellular process which result in destruction of, or 
otherwise inactivation of the POSH polypeptide of interest. Such homologs, and the 
genes which encode them, can be utilized to alter POSH levels by modulating the 
half-life of the protein. For instance, a short half-life can give rise to more transient 
biological effects and, when part of an inducible expression system, can allow 
tighter control of recombinant POSH levels within the cell. As above, such proteins, 
and particularly their recombinant nucleic acid constructs, can be used in gene 
therapy protocols. 

In similar fashion, POSH homologs can be generated by die present 
combinatorial approach to act as antagonists, in that they are able to interfere with 
the ability of the corresponding wild-type protein to function. 

In a representative embodiment of this method, the amino acid sequences for 
a population of POSH homologs are aligned, preferably to promote the highest 
homology possible. Such a population of variants, xan include, for example, 
homologs from one or more species, or homologs from the same species but which 
differ due to mutation. Amino acids which appear at each position of the aligned 
sequences are selected to create a degenerate set of combinatorial sequences. In a 
preferred embodiment, the combinatorial library is produced by way of a degenerate 
library of genes encoding a library of polypeptides which each include at least a 
portion of potential POSH sequences. For instance, a mixture of synthetic 
oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential POSH nucleotide sequences are expressible as individual 
polypeptides, or alternatively, as a set of larger fusion proteins (e.g. for phage 
display). 

There are many ways by which the library of potential homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be carried out in an automatic DNA synthesizer, and 
the synthetic genes then be ligated into an appropriate gene for expression. The 
purpose of a degenerate set of genes is to provide, in one mixture, all of the 
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sequences encoding the desired set of potential POSH sequences. The synthesis of 
degenerate oligonucleotides is well known in the art (see for example, Narang, SA 
(1983) Tetrahedron 39:3; Itakura et al., (1981) Recombinant DNA, Proc. 3rd 
Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273- 
289; Itakura et al., (1984) Annu. Rev. Biochem. 53:323; Itakura et al, (1984) 
Science 198:1056; Ike et al., (1983) Nucleic Acid Res. 11:477). Such techniques 
have been employed in the directed evolution of other proteins (see, for example, 
Scott et al., (1990) Science 249:386-390; Roberts et al., (1992) PNAS USA 
89:2429-2433; Devlin et al, (1990) Science 249: 404-406; Cwirla et al, (1990) 
PNAS USA 87: 6378-6382; as well as U.S. Patent Nos: 5,223,409, 5,198,346, and 
5,096,815). 

Alternatively, other forms of mutagenesis can be utilized to generate a 
combinatorial library. For example, POSH homologs (both agonist and antagonist 
forms) can be generated and isolated from a library by screening using, for example, 
15 alanine scanning mutagenesis and the like (Ruf et al, (1994) Biochemistry 33:1565- 
1572; Wang et al, (1994) J. Biol. Chem. 269:3095-3099; Balint et al, (1993) Gene 
137:109-118;. Grodberg et al, (1993) Eur. J, Biochem..21 8:597-601; Nagashima et 
al, (1993) J. Biol. Chem. 268:2888-2892; Lowman et al, (1991) Biochemistry 
30:10832-10838; and Cunningham et al, (1989) Science 244:1081-1085), by linker 
scanning mutagenesis (Gustin et al, (1993) Virology 193:653-660; Brown et al., 
(1992) Mol. Cell Biol. 12:2644-2652; McKnight et al., (1982) Science 232:316); by 
saturation mutagenesis (Meyers et al, (1986) Science 232:613); by PCR 
mutagenesis (Leung et al., (1989) Method Cell Mol Biol 1:11-19); or by random 
mutagenesis, including chemical mutagenesis, etc. (Miller et al, (1992) A Short 
25 Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, NY; and Greener et 
al, (1994) Strategies in Mol Biol 7:32-34). Linker scanning mutagenesis, 
particularly in a combinatorial setting, is an attractive method for identifying 
truncated (bioactive) forms of POSH polypeptides. 

A wide range of techniques are known in the art for screening gene products 
30 of combinatorial libraries made by point mutations and truncations, and, for that 
matter, for screening cDNA libraries for gene products having a certain property. 
Such techniques will be generally adaptable for rapid screening of the gene libraries 
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generated by the combinatorial mutagenesis of POSH homologs. The most widely 
used techniques for screening large gene libraries typically comprises cloning the 
gene library into replicable expression vectors, transforming appropriate cells with 
the resulting library of vectors, and expressing the combinatorial genes under 
5 conditions in which detection of a desired activity facilitates relatively easy isolation 
of the vector encoding the gene whose product was detected. Each of the illustrative 
assays described below are amenable to high through-put analysis as necessary to 
screen large numbers of degenerate sequences created by combinatorial mutagenesis 
techniques. 

10 In an illustrative embodiment of a screening assay, candidate combinatorial 

gene products of one of the subject proteins are displayed on the surface of a cell or 
virus, and the ability of particular cells or viral particles to bind a POSH polypeptide 
is detected in a "panning assay". For instance, a library of POSH variants can be 
cloned into the gene for a surface membrane protein of a bacterial cell (Ladner et 

. 15 al.„ WO 88/06630; Fuchs et al., (1991) Bio/Technology:9: 1370^1371; and Goward 
et al., (1992) TIBS 18:136-140), and the resulting fusion protein detected by 
panning, e.g. using a fluorescently. labeled molecule which binds the POSH 
polypeptide, to score for potentially functional homologs. Cells can be visually 
inspected and separated under a fluorescence microscope, or, where the morphology 

20 of the cell permits, separated by a fluorescence-activated cell sorter. 

In similar fashion, the gene library can be expressed as a fusion protein on 
the surface of a viral particle. For instance, in the filamentous phage system, foreign 
peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to 

25 affinity matrices at very high concentrations, a large number of phage can be 
screened at one time. Second, since each infectious phage displays the 
combinatorial gene product on its surface, if a particular phage is recovered from an 
affinity matrix in low yield, the phage can be amplified by another round of 
infection. The group of almost identical E. coli filamentous phages Ml 3, fd, and fl 

30 are most often used in phage display libraries, as either of the phage gin or gVIH 
coat proteins can be used to generate fusion proteins without disrupting the ultimate 
packaging of the viral particle (Ladner et al., PCT publication WO 90/02909; 
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Garrard et al., PCT publication WO 92/09690; Marks et al., (1992) J. Biol. Chem. 
267:16007-16010; Griffiths et al., (1993) EMBO J. 12:725-734; Clackson et al., 
(1991) Nature 352:624-628; and Barbas et al., (1992) PNAS USA 89:4457-4461). 

The application also provides for reduction of the subject POSH 
polypeptides to generate mimetics, e.g. peptide or non-peptide agents, which are 
able to mimic binding of the authentic protein to another cellular partner. Such 
mutagenic techniques as described above, as well as the thioredoxin system, are also 
particularly useful for mapping the determinants of a POSH polypeptide which 
participate in protein-protein interactions involved in, for example, binding of 
proteins involved in viral maturation to each other. To illustrate, the critical residues 
of a POSH polypeptide which are involved in molecular recognition of a substrate 
protein can be determined and used to generate POSH polypeptide-derived 
peptidomimetics which bind to the substrate protein, and by inhibiting POSH 
binding, act to inhibit its biological activity. By employing, for example, scanning 
mutagenesis to map the amino acid residues? of a POSR polypeptide which are 
involved in binding to another polypeptide, . peptidomimetic compounds can be 
generated which mimic those residues involved in binding. For instance, non- 
hydrolyzable peptide analogs of such residues can be generated using 
benzodiazepine (e.g., see Freidinger et al., in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher. Leiden, Netherlands, 1988), azepine (e.g., see 
Huffman et al., in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), substituted gamma lactam rings (Garvey et 
al., in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: 
Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al., (1986) 
J. Med. Chem. 29:295; and Ewenson et al., in Peptides: Structure and Function 
(Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. 
Rockland, IL, 1985), b-turn dipeptide cores (Nagai et al., (1985) Tetrahedron Lett 
26:647; and Sato et al., (1986) J Chem Soc Peikin Trans 1:1231), and b- 
aminoalcohols (Gordon et al., (1985) Biochem Biophys Res Commun 126:419; and 
Dann et al., (1986) Biochem Biophys Res Commun 134:71). 

The following table provides the sequences of the RING domain and the 
various SH3 domains. 
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Table 6. Amino Acid Sequences and related SEQ ED NOs for domains in human 
POSH 



Name of 

••Via 

cne 

sequence 


Sequence 


SEQ ID 
NO. 


RING 
domain 


CPVCLERLDASAKVLPCQHTFCKRCLLGIVGSRNELRCPEC 


26 


1 st SH 3 
domain 


PCAKALYNYEGKE PGDLKFSKGDI 1 1 LRRQVDENWYHGEVNGIHGF 
FPTNFVQIIK 


27 


2 nd SH 3 
domain 


PQCKALYDFEVKDKEADKDCLPFAKDDVLTVII^VDENWAEGMliAD 
KIGIFPISYVEFNS 


28 


3 rd SH 3 
domain 


SVYVAIYPYTPRKEDELEIiRKGEMFLVFERCQDGWFKGTSMHTSKI 
GVFPGNYVAPVT 


29 


4 th SH 3 
domain 


ERHRVWSYPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKT 

• ... 'v ' * . ' ■ 

GLFPGSFVENI 


30 



5 • 
5. * Antibodies and Uses Thereof 

Another aspect of the application pertains to an antibody specifically reactive 
with a POSH polypeptide. For example, by using immunogens derived from a 
POSH polypeptide, e.g., based on the cDNA sequences, anti-protein/anti-peptide 

10 antisera or monoclonal antibodies can be made by standard protocols (See, for 
example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring 
Harbor Press: 1988)). A mammal, such as a mouse, a hamster or rabbit can be 
immunized with an immunogenic form of the peptide (e.g., a POSH polypeptide or 
an antigenic fragment which is capable of eliciting an antibody response, or a fusion 

15 protein as described above). Techniques for conferring immunogenicity on a protein 
or peptide include conjugation to carriers or other techniques well known in the art 
An immunogenic portion of a POSH polypeptide can be administered in the 
presence of adjuvant The progress of immunization can be monitored by detection 
of antibody titers in plasma or serum. Standard ELISA or other immunoassays can 

20 be used with the immunogen as antigen to assess the levels of antibodies. In a 
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preferred embodiment, the subject antibodies are immunospecific for antigenic 
determinants of a POSH polypeptide of a mammal, e.g., antigenic determinants of a 
protein set forth in SEQ ID NO:2. 

In one embodiment, antibodies are specific for a RING domain or an SH3 
5 domain, and preferably the domain is part of a POSH polypeptide. In a more 
specific embodiment, the domain is part of an amino acid sequence set forth in SEQ 
ID NO:2. In a set of exemplary embodiments, an antibody binds to one or more 
SH3 domains represented by amino acids 137-192 of SEQ ID NO:2, amino acids 
199-258 of SEQ ID NO:2> amino acids 448-505 of SEQ ID NO:2, and/or amino 

10 acids 832-888 of SEQ ID NO:2. In another exemplary embodiment, an antibody 
binds to a RING domain represented by amino acids 12-52 of SEQ ID NO:2. In 
another embodiment, the antibodies are immunoreactive with one or more proteins 
having an amino acid sequence that is at least 80% identical to an amino acid 
sequence as set forth in SEQ ID NO:2. In other embodiments, an antibody is 

15 immunoreactive with one or more proteins having an amino acid sequence that is 
85%, 90%, 95%, 98%, 99% or identical to an amino acid sequence as set forth in 
SEQIDNO:2. - 

Following immunization of an animal with an antigenic preparation of a 
POSH polypeptide, anti-POSH antisera can be obtained and, if desired, polyclonal 

20 anti-POSH antibodies isolated from the serum. To produce monoclonal antibodies, 
antibody-producing cells (lymphocytes) can be harvested from an immunized animal 
and fused by standard somatic cell fusion procedures with immortalizing cells such 
as myeloma cells to yield hybridoma cells. Such techniques are well known in the 
art, and include, for example, the hybridoma technique (originally developed by 

25 Kohler and Milstein, (1975) Nature, 256: 495-497), the human B cell hybridoma 
technique (Kozbar et al., (1983) Immunology Today, 4: 72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al., (1985) 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. pp. 77-96). 
Hybridoma cells can be screened immunochemically for production of antibodies 

30 specifically reactive with a mammalian POSH polypeptide of the present application 
and monoclonal antibodies isolated from a culture comprising such hybridoma cells. 
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In one embodiment anti-human POSH antibodies specifically react with the protein 
encoded by a nucleic acid having SEQ ID NO:2. 

The term antibody as used herein is intended to include fragments thereof 
which are also specifically reactive with one of the subject POSH polypeptides. 
5 Antibodies can be fragmented using conventional techniques and the fragments 
screened for utility in the same manner as described above for whole antibodies. For 
example, F(ab>2 fragments can be generated by treating antibody with pepsin. The 
resulting F(ab)2 fragment can be treated to reduce disulfide bridges to produce Fab 
fragments. The antibody of the present application is further intended to include 

10 bispecific, single-chain, and chimeric and humanized molecules having affinity for a 
POSH polypeptide conferred by at least one CDR region of the antibody. In 
preferred embodiments, the antibodies, the antibody further comprises a label 
attached thereto and able to be detected, (e.g., the label can be a radioisotope, 
fluorescent compound, enzyme or enzyme co-factor). 

15 Anti-POSH antibodies can be Used, e.g., to monitor POSH polypeptide levels 

in an individual, particularly the presence of POSH at the plasma membrane for 
determining whether or not said patient is infected with a virus such as an RNA 
virus, a retroid virus, and an envelope virus, or allowing determination of the 
efficacy of a given treatment regimen for an individual afflicted with such a 

20 disorder. In addition, POSH polypeptides are expected to localize, occasionally, to 
the released viral particle. Viral particles may be collected and assayed for the 
presence of a POSH polypeptide. The level of POSH polypeptide may be measured 
in a variety of sample types such as, for example, cells and/or in bodily fluid, such as 
in blood samples. 

25 Another application of anti-POSH antibodies of the present application is in 

the immunological screening of cDNA libraries constructed in expression vectors 
such as gtll, gtl8-23, ZAP, and ORF8. Messenger libraries of this type, having 
coding sequences inserted in the correct reading frame and orientation, can produce 
fusion proteins. For instance, gtll will produce fusion proteins whose amino 

30 termini consist of B-galactosidase amino acid sequences and whose carboxy termini 
consist of a foreign polypeptide. Antigenic epitopes of a POSH polypeptide, e.g., 
other orthologs of a particular protein or other paralogs from the same species, can 
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then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted 
from infected plates with the appropriate anti-POSH antibodies. Positive phage 
detected by this assay can then be isolated from the infected plate. Thus, the 
presence of POSH homologs can be detected and cloned from other animals, as can 
5 alternate isoforms (including splice variants) from humans. 

In certain embodiments, the application provides antibodies that disrupt the 
interaction between POSH and a POSH-AP. Such an antibody may bind to an 
epitope of the POSH-AP. Such an antibody may also bind to an epitope of POSH. 

10 6. Homology Searching of Nucleotide and Polypeptide Sequences 

The nucleotide or amino acid sequences of the application may be used as 
query sequences against databases such as GenBank, SwissProt, BLOCKS, and 
Pima II. These databases contain previously identified and annotated sequences that 
can be searched for regions of homology (similarity) using BLAST, which stands for 

15 Basic Local Alignment Search Tool (Altschul S F (1.993) J Mol Evol 36:290-300; 
Altschul, S F et al (1990) J Mol Biol 215:403-10). 

BLAST produces alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the alignments, 
BLAST is especially useful in determining exact matches or in identifying homologs 

20 which may be of prokaryotic (bacterial) or eukaryotic (animal, fungal or plant) 
origin. Other algorithms such as the one described in Smith, R. F. and T. F. Smith 
(1992; Protein Engineering 5:35-51), incorporated herein by reference, can be used 
when dealing with primary sequence patterns and secondary structure gap penalties. 
As disclosed in this application, sequences have lengths of at least 49 nucleotides 

25 and no more than 12% uncalled bases (where N is recorded rather than A, C, G, or 
T). 

The BLAST approach, as detailed in Karlin and Altschul (1993; Proc Nat 
Acad Sci 90:5873-7) and incorporated herein by reference, searches matches 
between a query sequence and a database sequence, to evaluate the statistical 
30 significance of any matches found, and to report only those matches which satisfy 
the user-selected threshold of significance. Preferably the threshold is set at 10-25 
for nucleotides and 3-15 for peptides. 
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7. Transgenic Animals and Uses Thereof 

Another aspect of the application features transgenic non-human animals 
which express a heterologous POSH and/or POSH-AP gene, preferentially a human 

5 POSH and/or POSH-AP gene of the present application, and/or which have had one 
or both copies of the endogenous POSH and/or POSH-AP genes disrupted in at least 
one of the tissue or cell-types of the animal. Accordingly, the application features 
an animal model for viral infection. In one embodiment, the transgenic non-human 
animals is a mammal such as a mouse, rat, rabbit, goat, sheep, dog, cat, cow, or non- 

10 human primate. Without being bound to theory, it is proposed that such an animal 
may be susceptible to infection with envelope viruses, retroid viruses and RNA 
viruses such as various rhabdoviruses, lentiviruses, and filoviruses. Accordingly, 
such a transgenic animal may serve as a useful animal model to study the 
progression of diseases caused by such viruses. Alternatively, such an animal can be 

15 useful as a basis to introduce one or more other human transgenes, to create a 
transgenic animal carrying multiple human genes involved in infection caused by 
retroid viruses, or RNA viruses, and envelope viruses. Retroid viruses include 
lentiviruses such as HIV. Other RNA viruses include filoviruses such as Ebola 
virus. As a result of the introduction of multiple human transgenes, the transgenic 

20 animal may become susceptible to certain viral infection and therefore provide an 
useful animal model to study these viral infection. 

In a preferred embodiment, the transgenic animal carrying human POSH 
gene is useful as a basis to introduce other human genes involved in HIV infection, 
such as Cyclin Tl, CD34, CCR5, and fusin (CRCX4). In a further embodiment, the 

25 additional human transgene is a gene involved in a disease or condition that is 
associated with AIDS (e.g. hypertension, Kaposi's sarcoma, cachexia, etc.) Such an 
animal may be an useful animal model for studying HIV infection, AIDS and related 
disease development 

Another aspect of the present application concerns transgenic animals which 
30 are comprised of cells (of that animal) which contain a transgene of the present 
application and which preferably (though optionally) express an exogenous POSH 
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and/or POSH-AP protein in one or more cells in the animal. A POSH or POSH-AP 
transgene can encode the wild-type form of the protein, or can encode homologs 
thereof, as well as antisense constructs- Moreover, it may be desirable to express the 
heterologous transgene conditionally such that either the timing or the level of gene 
5 expression can be regulated. Such conditional expression can be provided using 
prokaryotic promoter sequences which require prokaryotic proteins to be 
simultaneous expressed in order to facilitate expression of the transgene. Exemplary 
promoters and the corresponding trans-activating prokaryotic proteins are given in 
U.S. Pat No. 4,833,080. 

10 Moreover, transgenic animals exhibiting tissue specific expression can be 

generated, for example, by inserting a tissue specific regulatory element, such as an 
enhancer, into the transgene. For example, the endogenous POSH or POSH-AP gene 
promoter or a portion thereof can be replaced with another promoter and/or 
enhancer, e.g., a CMV or a Moloney murine leukemia virus (MLV) promoter and/or 

15 : enhancer. 

Alternatively, non-human transgenic animals that only express HIV 
transgenes in the brain can be generated using brain specific promoters (e.g. myelin 
basic protein (MBP) promoter, the neurofilament protein (NF-L) promoter, the 
gonadotropin-releasing hormone promoter, the vasopressin promoter and the 

20 neuron-specific enolase promoter, see So Forss-Petter et al., Neuron, 5, 187, (1990). 
Such animals can provide a useful in vivo model to evaluate the ability of a potential 
anti-HIV drug to cross the blood-brain barrier. Other target cells for which specific 
promoters can be used are, for example, macrophages, T cells and B cells. Other 
tissue specific promoters are well-known in the art, see e.g. RJaenisch, Science, 

25 240, 1468 (1988). 

Non-human transgenic animals containing an inducible transgene can be 
generated using inducible regulatory elements (e.g. metallothionein promoter), 
which are well-known in the art. Transgene expression can then be initiated in these 
animals by administering to the animal a compound which induces gene expression 
30 (e.g. heavy metals). Another preferred inducible system comprises a tetracycline- 
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inducible transcriptional activator (U.S. Pat No. 5,654,168 issued Aug. 5, 1997 to 
Bujard and Gossen and U.S. Pat. No. 5,650,298 issued Jul. 22, 1997 to Bujard et 
al.). 

In general, transgenic animal lines can be obtained by generating transgenic 
5 animals having incorporated into their genome at least one transgene, selecting at 
least one founder from these animals and breeding the founder or founders to 
establish at least one line of transgenic animals having the selected transgene 
incorporated into their genome. 

Animals for obtaining eggs or other nucleated cells (e.g. embryonic stem 
10 cells) for generating transgenic animals can be obtained from standard commercial 
sources such as Charles River Laboratories (Wilmington, Mass.), Taconic 
(Germantown, N.Y.), Harlan Sprague Dawley (Indianapolis, Ind.). 

Eggs can be obtained from suitable animals, e.g., by flushing from the 
oviduct or using techniques described in U.S. Pat. No. 5,489,742 issued Feb. 6, 1996 

15 to Hammer and Taurog; U.S. Pat. No. 5,625,125 issued on Apr. 29, 1997 to Bennett 
et al.; Gordon et al., 1980, Proc. Natl. Acad. Sci. USA 77:7380-7384; Gordon & 
Ruddle, 1981, Science 214: 1244-1246; U.S. Pat. No. 4,873,191 to T. E. Wagner 
and P. C. Hoppe; U.S. Pat No. 5,604,131; Armstrong, et al. (1988) J. of 
Reproduction, 39:51 1 or PCT application No. PCT/FR93/00598 (WO 94/00568) by 

20 Mehtali et al. Preferably, the female is subjected to hormonal conditions effective to 
promote superovulation prior to obtaining the eggs. 

Many techniques can be used to introduce DNA into an egg or other 
nucleated cell, including in vitro fertilization using sperm as a carrier of exogenous 
DNA ("sperm-mediated gene transfer", e.g., Lavitrano et al., 1989, Cell 57: 717- 
25 723), microinjection, gene targeting (Thompson et al., 1989, Cell 56: 313-321), 
electroporation (Lo, 1983, Mol. Cell. Biol. 3: 1803-1814), transfection, or retrovirus 
mediated gene transfer (Van der Putten et al., 1985, Proc. Natl. Acad. Sci. USA 82: 
6148-6152). For a review of such techniques, see Gordon (1989), Transgenic 
Animals, Intl. Rev. Cytol. 1 15:171-229. 
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Except for sperm-mediated gene transfer, eggs should be fertilized in 
conjunction with (before, during or after) other transgene transfer techniques. A 
preferred method for fertilizing eggs is by breeding the female with a fertile male. 
However, eggs can also be fertilized by in vitro fertilization techniques. 

5 Fertilized, transgene containing eggs can than be transferred to 

pseudopregnant animals, also termed "foster mother animals", using suitable 
techniques. Pseudopregnant animals can be obtained, for example, by placing 40-80 
day old female animals, which are more than 8 weeks of age, in cages with infertile 
males, e.g., vasectomized males. The next morning females are checked for vaginal 
10 plugs. Females who have mated with vasectomized males are held aside until the 
time of transfer. 

Recipient females can be synchronized, e.g. using GNRH agonist (GnRH-a): 
des-gJylO, (D-Ala6)-LH-RH Ethylamide, SigmaChemical Co.,St. Louis, Mo. 
Alternatively, a unilateral pregnancy can be achieved by a brief surgical procedure 

15 involving the "peeling" away of the bursa membrane on the left uterine horn. 
Injected embryos can then be transferred to the left uterine horn via the 
infundibulum. Potential transgenic founders can typically be identified immediately 
at birth from the endogenous litter mates. For generating transgenic animals from 
embryonic stem cells, see e.g. Tetatocarcinomas and embryonic stem cells, a 

20 practical approach, ed. E. J. Robertson, (IRL Press 1987) or in Potter et al Proc. 
Natl Acad, Sci. USA 81, 7161 (1984), the teachings of which are incorporated 
herein by reference. 

Founders that express the gene can then bred to establish a transgenic line. 
Accordingly, founder animals can be bred, inbred, crossbred or outbred to produce 

25 colonies of animals of the present application. Animals comprising multiple 
transgenes can be generated by crossing different founder animals (e.g. an HIV 
transgenic animal and a transgenic animal, which expresses human CD4), as well as 
by introducing multiple transgenes into an egg or embryonic cell as described above. 
Furthermore, embryos from A-transgenic animals can be stored as frozen embryos, 

30 which are thawed and implanted into pseudo-pregnant animals when needed (See 
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e.g. Hirabayashi et al. (1997) Exp Anim 46: 1 11 and Anzai (1994) Jikken Dobutsu 
43: 247). 

The present application provides for transgenic animals that carry the 
transgene in all their cells, as well as animals that cany the transgene in some, but 
5 not all cells, i.e., mosaic animals. The transgene can be integrated as a single 
transgene or in tandem, e.g., head to head tandems, or head to tail or tail to tail or as 
multiple copies. 

The successful expression of the transgene can be detected by any of several 
means well known to those skilled in the art. Non-limiting examples include 
10 Northern blot, in situ hybridization of mRNA analysis, Western blot analysis, 
immunohistochemistry, and FACS analysis of protein expression. 

In a further aspect, the application features non-human animal cells 
containing a POSH or POSH-AP transgene, preferentially a human POSH or POSH- 
AP transgene. For example, the animal cell (e.g. somatic cell or germ cell (i.e. egg or 
15 sperm)) can be obtained from the transgenic animal. Transgenic somatic cells or cell 
lines can be used, for example, in drug screening assays. Transgenic germ cells, on 
the other hand, can be used in generating transgenic progeny, as described above. 

The application further provides methods for identifying (screening) or for 
determining the safety and/or efficacy of virus therapeutics, i.e. compounds which 
20 are useful for treating and/or preventing the development of diseases or conditions, 
which are caused by, or contributed to by viral infection (e.g. AIDS). In addition the 
assays are useful for further improving known anti-viral compounds, e.g, by 
modifying their structure to increase their stability and/oT activity and/or toxicity. 

The transgenic animals can be used in in vivo assays to identify viral 
25 therapeutics. For example, the animals can be used in assays to identify compounds 
which reduce or inhibit any phase of the viral life cycle, e.g., expression of one or 
more viral genes, activity of one or more viral proteins, glycosylation of one or more 
viral proteins, processing of one or more viral proteins, viral replication, assembly of 
virions, and/or budding of infectious virions. 
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In an exemplary embodiment, the assay comprises administering a test 
compound to a transgenic animal of the application infected with a virus including 
RNA viruses, DNA viruses, retroidvirus and/or envelope viruses, and comparing a 
phenotypic change in the animal relative to a transgenic animal which has not 

5 received the test compound. For example, where the animal is infected with HIV, 
the phenotypic change can be the amelioration in an AIDS related complex (ARC), 
cataracts, inflammatory lesions in the central nervous system (CNV), a mild kidney 
sclerotic lesion, or a skin lesion, such as psoratic dermatitis, hyperkerstotic lesions, 
Kaposi's sarcoma or cachexia. The effect of a compound on inhibition of Kaposi's 

10 sarcoma can be determined, as described, e.g., in PCTAJS97/1 1202 (W097/49373) 
by Gallo et al. These and other HIV related symptoms or phenotypes are further 
described in Leonard et al. (1988) Science 242:1665. 

In another embodiment, the phenotypic change is release/budding of virus 
particles. In yet another embodiment, the phenotypic change is the number of CD4+ 
1 5 T cells or the ratio of CD4+ 1 cells versus CD8+ T cells. In HIV infected humans as 
well as in HIV transgenic mice, analysis of lymph nodes indicate that the number of 
CD4+ T cells decreases and the number of CD8+ T cells increases. Numbers of 
CD4+ and CD8+ T cells can be determined, for example, by indirect 
immunofluorescence and flow cytometry, as described, e.g., in Santoro et al., supra. 

20 Alternatively, a phenotypic change, e.g. a change in the expression level of 

an HIV gene can be monitored. The HIV RNA can be selected from the group 
consisting of gag mRNA, gag-pro-pol mRNA, vif mRNA, vpr mRNA, tat mKNA, 
rev mRNA, vpu/env mRNA, nef mRNA, and vpx mRNA. The HIV protein can be 
selected from the group consisting of Pr55 Gag and fragments thereof (pl7 MA, p24 

25 CA, p7 NC, pi , p9, p6, and p2), Prl 60 Gag-Pro-Pol, and fragments thereof (p 10 PR, 
p51 RT, p66 RT, p32 IN), p23 Vif, pi 5 Vpr, pl4 Tat, pl9 Rev, pl6 Vpu, gPr 160 
Env or fragments thereof (gpl20 SU and gp41TM), p27 Net and pl4 Vpx. The 
level of any of these mRNAs or proteins can be determined in cells from a tissue 
sample, such as a skin biopsy, as described in, e.g., PCT/US97/1 1202 (W097/49373) 

30 by Gallo et al. Quantitation of HIV mRNA and protein is further described 
elsewhere herein and also in, e.g., Dickie et al. (1996) AIDS Res. Human 
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Retrovirases 12:1103. In a preferred embodiment, the level of gpl20 on the surface 
of PBMC is determined. This can be done, as described in the examples, e.g., by 
immunofluorescence on PBMC obtained from the animals. 

A further phenotypic change is the production level or rate of viral particles 
5 in the serum and/or tissue of the animal. This can be determined, e.g., by 
determining reverse transcriptase (RT activity) or viral load as described elsewhere 
herein as well as in PCT/US97/11202 (W097/49373) by Gallo et aL, such as by 
determining p24 antigen. 

Yet another phenotypic change, which can indicate HIV infection or AIDS 
10 progression is the production of inflammatory cytokines such as 11^6, IL-8 and 
TNF-.alpha.; thus, efficacy of a compound as an anti-HIV therapeutic can be 
assessed by ELISA tests for the reduction of serum levels of any or all of these 
cytokines. 

A vaccine can be tested by administering a test antigen to a transgenic animal' 
15 of the application. The animal can optionally be boosted with the same or a different 
antigen. Such animal is then infected with a virus such as HIV. The production of 
viral particles or expression of viral proteins is then measured at various times 
following the administration of the test vaccine. A decrease in the amount of viral 
particles produced or viral expression will indicate that the test vaccine is efficient in 
20 reducing or inhibiting viral production and/or expression. The amount of antibody 
produced by the animal in response to the vaccine antigen can also be determined 
according to methods known in the art and provides a relative indication of the 
immunogenicity of the particular antigen. 

Cells from the transgenic animals of the application can be established in 
25 culture and immortalized to establish cell lines. For example, immortalized cell lines 
can be established from the livers of transgenic rats, as described in Bulera et al. 
(1997) Hepatology 25: 1 192. Cell lines from other types of cells can be established 
according to methods known in the art." 

In one cell-based assay, cells expressing a POSH or POSH-AP transgene can 
30 be infected with a virus of interest and incubated in the presence a test compound or 
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a control compound. The production of viral particles is then compared. This assay 
system thus provides a means of identifying molecular antagonists which, for 
example, function by interfering with viral release/budding. 

Cell based assays can also be used to identify compounds which modulate 
5 expression of a viral gene, modulate translation of a viral mRNA, ot which modulate 
the stability of a viral mRNA or protein. Accordingly, a cell which is infected with a 
virus of interest can be incubated with a test compound and the amount of the viral 
protein produced in the cell medium can be measured and compared to that 
produced from a cell which has not been contacted with the test compound. The 
10 specificity of the compound for regulating the expression of the particular virus gene 
can be confirmed by various control analyses, e.g., measuring the expression of one 
or more control genes. This type of cellular assay can be particularly useful for 
determining the efficacy of antisense molecules or ribozymes. 

15 8. RNA Interference. Ribozymes. Antisense and DNA Enzyme 

In certain aspects, the application relates to RNAi, ribosyme, antisense and 
other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH activity. Exemplary RNAi and ribozyme molecules may 
comprise a sequence as shown in any of SEQ ID Nos: 15, 16, 18, 19, 21, 22, 24 and 

20 25. In certain aspects, the application relates to RNAi, ribozyme, antisense and 
other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH-AP activity. Sequences for examples of UNC48, MSTP028 
and HERPUD1 nucleic acids that may be used to design nucleic acids for RNAi, 
ribozyme, antisense are listed in the Examples. 

25 Certain embodiments of the application make use of materials and methods 

for effecting knockdown of one or more POSH or POSH-AP genes by means of 
RNA interference (RNAi). RNAi is a process of sequence-specific post- 
transcriptional gene repression which can occur in eukaryotic cells. In general, this 
process involves degradation of an mRNA of a particular sequence induced by 

30 double-stranded RNA (dsRNA) that is homologous to that sequence. For example, 
the expression of a long dsRNA corresponding to the sequence of a particular single- 
stranded mRNA (ss mRNA) will labilize that message, thereby "interfering" with 
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expression of the corresponding gene. Accordingly, any selected gene may be 
repressed by introducing a dsRNA which corresponds to all or a substantial part of 
the mRNA for that gene. It appears that when a long dsRNA is expressed, it is 
initially processed by a ribonuclease III into shorter dsRNA oligonucleotides of as 
5 few as 21 to 22 base pairs in length. Furthermore, Accordingly, RNAi may be 
effected by introduction or expression of relatively short homologous dsRNAs. 
Indeed the use of relatively short homologous dsRNAs may have certain advantages 
as discussed below. 

Mammalian cells have at least two pathways that are affected by double- 

10 stranded RNA (dsRNA). In the RNAi (sequence-specific) pathway, the initiating 
dsRNA is first broken into short interfering (si) RNAs, as described above. The 
siRNAs have sense and antisense strands of about 21 nucleotides that form 
approximately 19 nucleotide si RNAs with overhangs of two nucleotides at each 3' 
end. Short interfering RNAs are thought to provide the sequence information that 

15 allows a specific messenger RNA to be targeted for degradation. In contrast, the 
nonspecific pathway is triggered by dsRNA of any sequence, as long as it is at least 
about 30 base pairs in- length. The nonspecific effects occur because. dsRNA 
activates two enzymes: PKR, which in its active form phosphorylates the translation 
initiation factor eIF2 to shut down all protein synthesis, and 2\ 5' oligoadenylate 

20 synthetase (2% 5'-AS), which synthesizes a molecule that activates Rnase L, a 
nonspecific enzyme that targets all mRNAs. The nonspecific pathway may 
represents a host response to stress or viral infection, and, in general, the effects of 
the nonspecific pathway are preferably minimized under preferred methods of the 
present application. Significantly, longer dsRNAs appear to be required to induce 

25 the nonspecific pathway and, accordingly, dsRNAs shorter than about 30 bases pairs 
are preferred to effect gene repression by RNAi (see Hunter et al. (1975) J Biol 
Chem 250: 409-17; Manche et al. (1992) Mol Cell Biol 12: 5239-48; Minks et al. 
(1 979) J Biol Chem 254: 101 80-3; and Elbashir et al. (2001) Nature 41 1 : 494-8), 

RNAi has been shown to be effective in reducing or eliminating the 

30 expression of genes in a number of different organisms including Caenorhabditiis 
elegans (see e.g. Fire et al. (1998) Nature 391: 806-11), mouse eggs and embryos 
(Wianny et al. (2000) Nature Cell Biol 2: 70-5; Svoboda et al. (2000) Development 
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127: 4147-56), and cultured RAT-1 fibroblasts (Bahramina et al. (1999) Mol Cell 
Biol 19: 274-83), and appears to be an anciently evolved pathway available in 
eukaiyotic plants and animals (Sharp (2001) Genes Dev. 15: 485-90). KNAi has 
proven to be an effective means of decreasing gene expression in a variety of cell 

5 types including HeLa cells, NIH/3T3 cells, COS cells, 293 cells and BHK-21 cells, 
and typically decreases expression of a gene to lower levels than that achieved using 
antisense techniques and, indeed, frequently eliminates expression entirely (see Bass 
(2001) Nature 411: 428-9). In mammalian cells, siRNAs are effective at 
concentrations that are several orders of magnitude below the concentrations 

1 0 typically used in antisense experiments (Elbashir et al . (200 1 ) Nature 411: 494-8). 

The double stranded oligonucleotides used to effect RNAi are preferably less 
than 30 base pairs in length and, more preferably, comprise about 25, 24, 23, 22, 21, 
20, 19, 18 or 17 base pairs of ribonucleic acid. Optionally the dsRNA 
oligonucleotides of the application may include 3* overhang ends. Exemplary 2- 

15 nucleotide 3' overhangs may be composed of ribonucleotide residues of any type 
and may even be composed of 2*Hleoxythymidine resides, which lowers the cost of 
RNA synthesis and may enhance nuclease resistance of siRNAs in the cell culture 
medium and within transfected cells (see Elbashir et al. (2001) Nature 411: 494-8). 
Longer dsRNAs of 50, 75, 100 or even 500 base pairs or more may also be utilized 

20 in certain embodiments of the application. Exemplary concentrations of dsRNAs for 
effecting RNAi are about 0.05 nM, 0.1 nM, 0.5 nM, 1.0 nM, 1.5 nM, 25 nM or 100 
nM, although other concentrations may be utilized depending upon the nature of the 
cells treated, the gene target and other factors readily discernable the skilled artisan. 
Exemplary dsRNAs may be synthesized chemically or produced in vitro or in vivo 

25 using appropriate expression vectors. Exemplary synthetic RNAs include 21 
nucleotide RNAs chemically synthesized using methods known in the art (e.g. 
Expedite RNA phophoramidites and thymidine phosphoramidite (Proligo, 
Germany). Synthetic oligonucleotides are preferably deprotected and gel-purified 
using methods known in the art (see e.g. Elbashir et al. (2001) Genes Dev. 15: 188- 

30 200). Longer RNAs may be transcribed from promoters, such as T7 RNA 
polymerase promoters, known in the art. A single RNA target, placed in both 
possible orientations downstream of an in vitro promoter, will transcribe both 
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strands of the target to create a dsRNA oligonucleotide of the desired target 
sequence. Any of the above RNA species will be designed to include a portion of 
nucleic acid sequence represented in a POSH or POSH-AP nucleic acid, such as, for 
example, a nucleic acid that hybridizes, under stringent and/or physiological 
5 conditions, to any of SEQ ID Nos: 1, 3, 4, 6, 8 and 10 and complements thereof or 
any of the POSH-AP sequences presented in the Examples. 

The specific sequence utilized in design of the oligonucleotides may be any 
contiguous sequence of nucleotides contained within the expressed gene message of 
the target. Programs and algorithms, known in the art, may be used to select 

10 appropriate target sequences. In addition, optimal sequences may be selected 
utilizing programs designed to predict the secondary structure of a specified single 
stranded nucleic acid sequence and allowing selection of those sequences likely to 
occur in exposed single stranded regions of a folded mRNA. Methods and 
compositions for designing appropriate oligonucleotides may be found, for example, 
. 15 in U.S. Patent Nos. 6,251,588, the contents of which are incorporated herein by 
reference. Messenger RNA (mRNA) is generally thought of as a linear molecule 
which contains the information for directing protein synthesis within.the sequence of 
ribonucleotides, however studies have revealed a number of secondary and tertiary 
structures that exist in most mRNAs. Secondary structure elements in RNA are 

20 formed largely by Watson-Crick type interactions between different regions of the 
same RNA molecule. Important secondary structural elements include 
intramolecular double stranded regions, hairpin loops, bulges in duplex RNA and 
internal loops. Tertiary structural elements are formed when secondary structural 
elements come in contact with each other or with single stranded regions to produce 

25 a more complex three dimensional structure. A number of researchers have 
measured the binding energies of a large number of RNA duplex structures and have 
derived a set of rales which can be used to predict the secondary structure of RNA 
(see e.g. Jaeger et al. (1989) Proc. Natl. Acad. Sci. USA 86:7706 (1989); and Turner 
et al. (1988) Annu. Rev. Biophys. Biophys. Chem. 17:167) . The rules are useful in 

30 identification of RNA structural elements and, in particular, for identifying single 
stranded RNA regions which may represent preferred segments of the mRNA to 
target for silencing RNAi, ribozyme or antisense technologies. Accordingly, 
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preferred segments of the mRNA target can be identified for design of the RNAi 
mediating dsRNA oligonucleotides as well as for design of appropriate ribozyme 
and hammerheadribozyme compositions of the application. 

The dsRNA oligonucleotides may be introduced into the cell by transfection 
5 with an heterologous target gene using carrier compositions such as liposomes, 
which are known in the art- e.g. Lipofectamine 2000 (Life Technologies) as 
described by the manufacturer for adherent cell lines. Transfection of dsRNA 
oligonucleotides for targeting endogenous genes may be carried out using 
Oligofectamine (Life Technologies). Transfection efficiency may be checked using 

10 fluorescence microscopy for mammalian cell lines after co-transfection of hGFP- 
encoding pAD3 (Kehlenback et al. (1998) J Cell Biol 141: 863-74). The 
effectiveness of the RNAi may be assessed by any of a number of assays following 
introduction of the dsRNAs. These include Western blot analysis using antibodies 
which recognize the POSH or POSH-AP gene product following sufficient time for 

15 . turnover of the endogenous pool after new protein synthesis is repressed, reverse . 
transcriptase polymerase chain reaction and Northern blot analysis to determine the 
level of existing POSH or POSH-AP target mRNA. . 

Further compositions, methods and applications of RNAi technology are 
provided in U.S. Patent Application Nos. 6,278,039, 5,723,750 and 5,244,805, 

20 which are incorporated herein by reference. 

Ribozyme molecules designed to catalytically cleave POSH or POSH-AP 
mRNA transcripts can also be used to prevent translation of suject POSH or POSH- 
AP mRNAs and/or expression of POSH or POSH-APs (see, e.g., PCT International 
Publication WO90/11364, published October 4, 1990; Sarver et al. (1990) Science 

25 247:1222-1225 and U.S. Patent No. 5,093,246). Ribozymes are enzymatic RNA 
molecules capable of catalyzing the specific cleavage of RNA. (For a review, see 
Rossi (1994) Current Biology 4: 469-471). The mechanism of ribozyme action 
involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 

30 composition of ribozyme molecules preferably includes one or more sequences 
complementaiy to a POSH or POSH-AP mRNA, and the well known catalytic 
sequence responsible for mRNA cleavage or a functionally equivalent sequence 
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(see, e.g., U.S. Pat. No. 5,093,246, which is incorporated herein by reference in its 
entirety). 

While ribozymes that cleave mRNA at site specific recognition sequences 
can be used to destroy target mRNAs, the use of hammerhead ribozymes is 
preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking 
regions that form complementary base pairs with the target mRNA. Preferably, the 
target mRNA has the following sequence of two bases: 5*-UG-3\ The construction 
and production of hammerhead ribozymes is well known in the art and is described 
more fully in Haseloflf and Gerlach ((1988) Nature 334:585-591; and see PCT 
Appln. No. WO89/05852, the contents of which are incorporated herein by 
reference). Hammerhead ribozyrne sequences can be embedded in a stable RNA 
such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo (Perriman et 
al. (1995) Proc. Natl. Acad. Sci. USA, 92: 6175-79; de Feyter, and Gaudron, 
Methods in Molecular Biology, Vol. 74, Chapter 43, "Expressing Ribozymes in 
Plants", Edited by Turner, P. C, Humana Press Inc., Totowa, N.J). In particular, 
.RNA polymerase Ill-mediated expression of tRNA fusion ribozymes are well 
known in the art ( see Kawasaki et al. (1998) Nature 393:. 284-9; Kuwabaia et al. 
(1998) Nature BiotechnoL 16: 961-5; and Kuwabara et at (1998) MoL Cell 2: 617- 
27; Koseki et al. (1999) J Virol 73: 1868-77; Kuwabara et al. (1999) Proc Natl Acad 
Sci USA 96: 1886-91; Tanabe et al. (2000) Nature 406: 473-4). There are typically 
a number of potential hammerhead ribozyrne cleavage sites within a given target 
cDNA sequence. Preferably the ribozyrne is engineered so that the cleavage 
recognition site is located near the 5' end of the target mRNA- to increase 
efficiency and minimize the intracellular accumulation of non-functional mRNA 
transcripts. Furthermore, the use of any cleavage recognition site located in the 
target sequence encoding different portions of the C-terminal amino acid domains 
of, for example, long and short forms of target would allow the selective targeting of 
one or the other form of the target, and thus, have a selective effect on one form of 
the target gene product. 

Gene targeting ribozymes necessarily contain a hybridizing region 
complementary to two regions, each of at least 5 and preferably each 6, 7, 8, 9, 10, 
1 1, 12, 13, 14, 15, 16, 17, 18, 19 or 20 contiguous nucleotides in length of a POSH 
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or POSH-AP mRNA, such as an mRNA of a sequence represented in any of SEQ ID 
Nos: 1, 3, 4, 6, 8 or 10 or a POSH-AP presented in the Examples. In addition, 
ribozymes possess highly specific endoribonuclease activity, which autocatalytically 
cleaves the target sense mRNA. The present application extends to ribozymes 
5 which hybridize to a sense mRNA encoding a POSH gene such as a therapeutic drug 
target candidate gene, thereby hybridising to the sense mRNA and cleaving it, such 
that it is no longer capable of being translated to synthesize a functional polypeptide 
product 

The ribozymes of the present application also include RNA 

10 endoribonucleases (hereinafter "Cech-type ribozymes") such as the one which 
occurs naturally in Tetrahymena themiophila (known as the IVS, or L-19 IVS RNA) 
and which has been extensively described by Thomas Cech and collaborators (Zaug, 
et al. (1984) Science 224:574-578; Zaug, et al. (1986) Science 231:470-475; Zaug, 
et al. (1986) Nature 324:429-433; published International patent application No, 

15 WO88/04300 by University Patents Inc.; Been, et al. (1986) Cell 47:207-216). The 
Cech-type ribozymes have an eight base pair active site which hybridizes to a target 
RNA sequence whereafter cleavage of the target RNA takes place. The application 
encompasses those Cech-type ribozymes which target eight base-pair active site 
sequences that are present in a target gene or nucleic acid sequence. 

20 Ribozymes can be composed of modified oligonucleotides (e.g., fpr 

improved stability, targeting, etc.) and should be delivered to cells which express the 
target gene in vivo. A preferred method of delivery involves using a DNA construct 
"encoding" the ribozyme under the control of a strong constitutive pol III or pol II 
promoter, so that transfected cells will produce sufficient quantities of the ribozyme 

25 to destroy endogenous target messages and inhibit translation. Because ribozymes, 
unlike antisense molecules, are catalytic, a lower intracellular concentration is 
required for efficiency. 

In certain embodiments, a ribozyme may be designed by first identifying a 
sequence portion sufficient to cause effective knockdown by RNAi. The same 

30 sequence portion may then be incorporated into a ribozyme. In this aspect of the 
application, the gene-targeting portions of the ribozyme or RNAi are substantially 
the same sequence of at least 5 and preferably 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 
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17, 18, 19 or 20 or more contiguous nucleotides of a POSH nucleic acid, such as a 
nucleic acid of any of SEQ ID Nos: 1, 3, 4, 6, 8, or 10 or POSH-AP nucleic acid, as 
presented in the Examples. In a long target RNA chain, significant numbers of 
target sites are not accessible to the ribozyme because they are hidden within 
5 secondary or tertiary structures (Birikh et al. (1997) Eur J Biochem 245: 1-16). To 
overcome the problem of target RNA accessibility, computer generated predictions 
of secondary structure are typically used to identify targets that are most likely to be 
single-stranded or have an "open" configuration (see Jaeger et al. (1989) Methods 
Enzymol 183: 281-306). Other approaches utilize a systematic approach to 

10 predicting secondary structure which involves assessing a huge number of candidate 
hybridizing oligonucleotides molecules (seeMilner et al. (1997) Nat Biotechnol 15: 
537-41; and Patzel and Sczakiel (1998) Nat Biotechnol 16: 64-8). Additionally, U.S. 
Patent No. 6,251,588, the contents of which are hereby incorporated herein, 
describes methods for evaluating oligonucleotide probe sequences so as to predict 

1-5 • the potential for hybridization to a target nucleic acid sequence. The method of the 
application provides for the use of such methods to select preferred, segments of a 
target mRNA sequence that are predicted to be single-stranded and, further, for the 
opportunistic utilization of the same or substantially identical target mRNA 
sequence, preferably comprising about 10-20 consecutive nucleotides of the target 

20 mRNA, in the design of both the RNAi oligonucleotides and ribozymes of the 
application. 

A further aspect of the application relates to the use of the isolated 
"antisense" nucleic acids to inhibit expression, e.g., by inhibiting transcription 
and/or translation of a POSH or POSH-AP nucleic acid. The antisense nucleic acids 

25 may bind to the potential drug target by conventional base pair complementarity, or, 
for example, in the case of binding to DNA duplexes, through specific interactions 
in the major groove of the double helix. In general, these methods refer to the range 
of techniques generally employed in the art, and include any methods that rely on 
specific binding to oligonucleotide sequences. 

30 An antisense construct of the present application can be delivered, for 

example, as an expression plasmid which, when transcribed in the cell, produces 
RNA which is complementary to at least a unique portion of the cellular mRNA 
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which encodes a POSH or POSH-AP polypeptide. Alternatively, the antisense 
construct is an oligonucleotide probe, which is generated ex vivo and which, when 
introduced into the cell causes inhibition of expression by hybridizing with the 
mRNA and/or genomic sequences of a POSH or POSH-AP nucleic acid. Such 
5 oligonucleotide probes are preferably modified oligonucleotides, which are resistant 
to endogenous nucleases, e.g., exonucleases and/or endonucleases, and are therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense 
oligonucleotides are phosphoramidate, phosphothioate and rriethylphosphonate 
analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 5,256,775). 

10 Additionally, general approaches to constructing oligomers useful in antisense 
therapy have been reviewed, for example, by Van der Krol et al. (1988) 
BioTechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 2668. 

With respect to antisense DNA, oligodeoxyribonucleotides derived from the 
translation initiation site, e.g., between the -10 and +10 regions of the target gene, 
. tS are preferred. Antisense approaches involve the design of oligonucleotides (either 
. DNA or RNA) that are complementary to mRNA encoding a POSH or POSH-AP 
polypeptide. .The antisense oligonucleotides will bind to the mRNA transcripts and 
prevent translation. Absolute complementarity, although preferred, is not required. 
In the case of double-stranded antisense nucleic acids, a single strand of the duplex 

20 DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more 
base mismatches with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be). One skilled in the art can ascertain a tolerable degree of 

25 mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

Oligonucleotides that are complementary to the 5* end of the mRNA, e.g., the 
5' untranslated sequence up to and including the AUG initiation codon, should work 
most efficiently at inhibiting translation. However, sequences complementary to the 

30 V untranslated sequences of mRNAs have recently been shown to be effective at 
inhibiting translation of mRNAs as well. (Wagner, R. 1994. Nature 372:333). 
Therefore, oligonucleotides complementary to either the 5* or 3 1 untranslated, non- 
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coding regions of a gene could be used in an antisense approach to inhibit translation 
of that mRNA. Oligonucleotides complementary to the 5' untranslated region of the 
mKNA should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mKNA coding regions are less efficient 

5 inhibitors of translation but could also be used in accordance with the application. 
Whether designed to hybridize to the 5', 3' or coding region of mRNA, antisense 
nucleic acids should be at least six nucleotides in length, and are preferably less that 
about 100 and more preferably less than about 50, 25, 17 or 10 nucleotides in length. 
It is preferred that in vitro studies are first performed to quantitate the ability 

10 of the antisense oligonucleotide to inhibit gene expression. It is preferred that these 
studies utilize controls that distinguish between antisense gene inhibition and 
nonspecific biological effects of oligonucleotides. It is also preferred that these 
studies compare levels of the target RNA or protein with that of an internal control 
KNA or protein. Results obtained using the antisense oligonucleotide may be 

15 compared with those obtained using a control oligonucleotide. It is preferred that 
the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from 
the antisense sequence no more than is necessary to prevent specific hybridization to 
the target sequence. 

20 The antisense oligonucleotides can be DNA or RNA or chimeric mixtures or 

derivatives or modified versions thereof, single-stranded or double-stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate 
backbone, for example, to improve stability of the molecule, hybridization, etc. The 
oligonucleotide may include other appended groups such as peptides (e.g., for 

25 targeting host cell receptors), or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 
6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
W088/09810, published December 15, 1988) or the blood- brain barrier (see, e.g., 
PCT Publication No. W089/10134, published April 25, 1988), hybridization- 

30 triggered cleavage agents. (See, e.g., Krol et al., 1988, BioTechniques 6:958- 976) 
or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, 
the oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
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hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base 
moiety which is selected from the group including but not limited to 5-fluorouracil, 

5 5- bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 
acetylcytosine, 5- (carboxyhydroxytiethyl) uracil, 5-carboxymethylaminomethyl-2- 
thiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D- . 
galactosylqueosine, inosine, N6- isopentenyladenine, 1-methylguanine, 1- 
methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 

10 methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6- isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 

15 methyluracil, *uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5- 
methyl-2-thiouracil, 3-(3-amino-3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including but not limited to arabinose, 2- 

20 fluoroarabinose, xylulose, and hexose. 

The antisense oligonucleotide can also contain a neutral peptide-like 
backbone. Such molecules are termed peptide nucleic acid (PNA)-oligomers and 
are described, e.g., in Peny-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. U.S.A. 
93:14670 and in Eglom et al. (1993) Nature 365:566. One advantage of PNA 

25 oligomers is their capability to bind to complementary DNA essentially 
independently from the ionic strength of the medium due to the neutral backbone of 
the DNA In yet another embodiment, the antisense oligonucleotide comprises at 
least one modified phosphate backbone selected from die group consisting of a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 

30 phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 
phosphotriester, and a formacetal or analog thereof. 
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In yet a further embodiment, the antisense oligonucleotide is an alpha- 
anomeric oligonucleotide. An alpha-anomexic oligonucleotide forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual 
antiparallel orientation, the strands run parallel to each other (Gautier et al., 1987, 

5 Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a T-O-methylribonucleotide 
(Inoue et al., 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

While antisense nucleotides complementary to the coding region of a POSH 
or POSH-AP mRNA sequence can be used, those complementary to the transcribed 

1 0 untranslated region may also be used. 

In certain instances, it may be difficult to achieve intracellular concentrations 
of the antisense sufficient to suppress translation on endogenous mRNAs. Therefore 
a preferred approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol m or pol II promoter. 
• . 15 . The use of such a construct to transfect target cells will result in the transcription of 
. sufficient amounts of single stranded RNAs that will form complementary base pairs 
with the endogenous potential drug target transcripts and thereby prevent translation. 
For example, a vector can be introduced such that it is taken up by a cell and directs 
the transcription of an antisense RNA. Such a vector can remain episomal or 

20 become chromosomally integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art Vectors can be plasmid, viral, or others 
known in the art, used for replication and expression in mammalian cells. 
Expression of the sequence encoding the antisense RNA can be by any promoter 

25 known in the art to act in mammalian, preferably human cells. Such promoters can 
be inducible or constitutive. Such prompters include but are not limited to: the 
SV40 early promoter region (Bemoist and Chambon, 1981, Nature 290:304-310), 
the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter 

30 (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory 
sequences of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42), etc. 
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Any type of plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct, which can be introduced directly into the tissue site. 

Alternatively, POSH or POSH-AP gene expression can be reduced by 
targeting deoxyribonucleotide sequences complementary to the regulatory region of 
5 the gene (i.e., the promoter and/or enhancers) to form triple helical structures that 
prevent transcription of the gene in target cells in the body. (See generally, Helene, 
C. 1991, Anticancer Drug Des., 6(6):569-84; Helene, C:, et al., 1992, Ann. N.Y. 
Acad. Sci., 660:27-36; and Maher, L.J., 1992, Bioassays 14(12):807-15). 

Nucleic acid molecules to be used in triple helix formation for the inhibition 

10 of transcription are preferably single stranded and composed of 
deoxyribonucleotides. The base composition of these oligonucleotides should 
promote triple helix formation via Hoogsteen base pairing rules, which generally 
require sizable stretches of either purines or pyrimidines to be present on one strand 
of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in 

15 . TAT and CGC triplets across the three associated strands of the resulting triple 
helix. The pyrimidine-rich molecules provide base complementarity to a purine-rich 
.region of a single strand of the duplex in a parallel orientation to that strand. In 
addition, nucleic acid molecules may be chosen that are purine- rich, for example, 
containing a stretch of G residues. These molecules will form a triple helix with a 

20 DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in CGC triplets across the 
three strands in the triplex. 

Alternatively, POSH or POSH-AP sequences that can be targeted for triple 
helix formation may be increased by creating a so called "switchback" nucleic acid 

25 molecule. Switchback molecules are synthesized in an alternating 5-3', 3 f -5* 
manner, such that they base pair with first one strand of a duplex and then the other, 
eliminating the necessity for a sizable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 

A further aspect of the application relates to the use of DNA enzymes to 

30 inhibit expression of a POSH or POSH-AP gene. DNA enzymes incorporate some 
of the mechanistic features of both antisense and ribozyme technologies. DNA 
enzymes are designed so that they recognize a particular target nucleic acid 
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sequence, much like an antisense oligonucleotide, however much like a ribozyme 
they are catalytic and specifically cleave the target nucleic acid. 

There are currently two basic types of DNA enzymes, and both of these were 
identified by Santoro and Joyce (see, for example, US Patent No. 6110462). The 
5 10-23 DNA enzyme comprises a loop structure which connect two arms. The two 
arms provide specificity by recognizing the particular target nucleic acid sequence 
while the loop structure provides catalytic function under physiological conditions. 

Briefly, to design an ideal DNA enzyme that specifically recognizes and 
cleaves a target nucleic acid, one of skill in the art must first identify the unique 
1 0 target sequence. This can be done using the same approach as outlined for antisense 
oligonucleotides. Preferably, the unique or substantially sequence is a G/C rich of 
approximately 18 to 22 nucleotides. High G/C content helps insure a stronger 
interaction between the DNA enzyme and the target sequence. 

When synthesizing the DNA enzyme, the specific antisense recognition 
15 sequence that will target the enzyme to the message is divided so that it comprises 
the two arms of the DNA enzyme, and the DNA enzyme loop is placed between the 
two specific arms. - •, 

Methods of making and administering DNA enzymes can be found, for 
example, in US 61 10462. Similarly, methods of delivery DNA ribozymes in vitro 
20 or in vivo include methods of delivery RNA ribozyme, as outlined in detail above. 
Additionally, one of skill in the art will recognize that, like antisense 
oligonucleotide, DNA enzymes can be optionally modified to improve stability and 
improve resistance to degradation. 

Antisense RNA and DNA, ribozyme, RNAi and triple helix molecules of the 
25 application may be prepared by any method known in the art for the synthesis of 
DNA and RNA molecules. These include techniques for chemically synthesizing 
oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as 
for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences 
30 encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase 
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promoters such as the T7 or SP6 polymerase promoters. Alternatively, antisense 
cDNA constructs that synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably into cell lines. Moreover, 
various well-known modifications to nucleic acid molecules may be introduced as a 
5 means of increasing intracellular stability and half-life. Possible modifications 
include but are not limited to the addition of flanking sequences of ribonucleotides 
or deoxyribonucleotides to the 5* and/or 3' ends of the molecule or the use of 
phosphorothioate or 2* O-methyl rather than phosphodiesterase linkages within the 
oligodeoxyribonucleotide backbone. 

10 

9- Drug Screening Assays 

In certain aspects, the present application also provides assays for identifying 
therapeutic agents which either interfere with or promote POSH or POSH-AP 
function. In certain embodiments, agents of the application are antiviral agents, 

15 optionally interfering with viral maturation, and preferably where the virus is a 
retroid virus, an RNA virus and an envelope virus. In certain preferred 
embodiments, an antiviral agent interferes with the ubiquitin ligase catalytic activity 
of POSH (e.g. POSH auto-ubiquitination or transfer to a target protein). In certain 
preferred embodiments, an antiviral agent interferes with the interaction betweeb 

20 POSH and a POSH-AP polypeptide, for example an antiviral agent may disrupt or 
render irreversible the interaction between a POSH polypeptide and POSH-AP 
polypeptide such as an UNC84, an MSTP028, a HERPUD1, another POSH 
polypeptide (as in the case of a POSH dimer, a heterodimer of two different POSH 
polypeptides, homomultimers and heteromultimers); a GTPase (eg. Rac, Racl, Rho, 

25 Ras); an E2 enzyme and ubiquitin, or optionally, a cullin; a clathrin; AP-1; AP-2; an 
HSP70; an HSP90, Bical, Bardl, Nef, PAK1, PAK2, PAK family, Vav, Cdc42, 
PI3K (e.g. p85 or pi 10), Nedd4, src (src family), a Gag, particularly an HIV Gag 
(e.g. pl60), TsglOl, VASP, RNB6, WASP, N-WASP and KIAA0674, Similar to 
Spred-2, as well as, in certain embodiments, proteins known to be associated with 

30 clathrin-coated vesicles and or proteins involved in the protein sorting pathway. In 
further embodiments, agents of the application are anti-apoptotic agents, optionally 
interfering with JNK and/or NF-kB signaling. In yet additional embodiments, 
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agents of the application interfere with the signaling of a GTPase, such as Rac or 
Ras, optionally disrupting the interaction between a POSH polypeptide and a Rac 
protein. In certain embodiments, agents of the application modulate the ubiquitin 
ligase activity of POSH and may be used to treat certain diseases related to ubiquitin 

5 ligase activity. 

In certain embodiments, the application provides assays to identify, optimize 
or otherwise assess agents that increase or decrease a ubiquitin-related activity of a 
POSH polypeptide. Ubiquitin-related activities of POSH polypeptides may include 
the self-ubiquitination activity of a POSH polypeptide, generally involving the 

10 transfer of ubiquitin from an E2 enzyme to the POSH polypeptide, and the 
ubiquitination of a target protein, generally involving the transfer of a ubiquitin from 
a POSH polypeptide to the target protein. In certain embodiments, a POSH activity 
is mediated, at least in part, by a POSH RING domain. 

In certain embodiments, an assay comprises forming a mixture comprising a 

15 POSH polypeptide, an ts2 polypeptide and a source of ubiquitin (which may be the 
E2 polypeptide pre-complexed with ubiquitin). Optionally the mixture comprises an 
El polypeptide and optionally the mixture comprises a target polypeptide. 
Additional components of the mixture may be selected to provide conditions 
consistent with the ubiquitination of the POSH polypeptide. One or more of a 

20 variety of parameters may be detected, such as POSH-ubiquitin conjugates, E2- 
ubiquitin thioesters, free ubiquitin and target polypeptide-ubiquitin complexes. The 
term "detect" is used herein to include a determination of the presence or absence of 
the subject of detection (e.g. POSH-ubiqutin, E2-ubiquitin, etc.), a quantitative 
measure of the amount of the subject of detection, or a mathematical calculation of 

25 the presence, absence or amount of the subject of detection, based on the detection 
of other parameters. The term "detect" includes the situation wherein the subject of 
detection is determined to be absent or below the level of sensitivity. Detection may 
comprise detection of a label (e.g. fluorescent label, radioisotope label, and other 
described below), resolution and identification by size (e.g. SDS-PAGE, mass 

30 spectroscopy), purification and detection, and other methods that, in view of this 
specification, will be available to one of skill in the art For instance, radioisotope 
labeling may be measured by scintillation counting, or by densitometry after 
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exposure to a photographic emulsion, or by using a device such as a 
Phosphorimager. Likewise, densitometry may be used to measure bound ubiquitin 
following a reaction with an enzyme label substrate that produces an opaque product 
when an enzyme label is used. In a preferred embodiment, an assay comprises 
5 detecting the POSH-ubiquitin conjugate. 

In certain embodiments, an assay comprises forming a mixture comprising a 
POSH polypeptide, a target polypeptide and a source of ubiquitin (which may be the 
POSH polypeptide pre-complexed with ubiquitin). Optionally the mixture 
comprises an El and/or E2 polypeptide and optionally the mixture comprises an E2- 

10 ubiquitin thioester. Additional components of the mixture may be selected to 
provide conditions consistent with the ubiquitination of the target polypeptide. One 
or more of a variety of parameters may be detected, such as POSH-ubiquitin 
conjugates and target polypeptide-ubiquitin conjugates. In a preferred embodiment, 
an assay comprises detecting the target polypeptide-ubiquitin conjugate. In another 

15 preferred embodiment, an assay comprises detecting the POSH-ubiquitin conjugate. 

An assay described above may be used in a screening assay to identify agents 
that modulate a ubiquitin-related activity of a POSH polypeptide. A screening assay 
will generally involve adding a test agent to one of the above assays, or any other 
assay designed to assess a ubiquitin-related activily of a POSH polypeptides The 

20 parameters) detected in a screening assay may be compared to a suitable reference. 
A suitable reference may be an assay run previously, in parallel or later that omits 
the test agent. A suitable reference may also be an average of previous 
measurements in the absence of the test agent In general the components of a 
screening assay mixture may be added in any order consistent with the overall 

25 activity to be assessed, but certain variations may be preferred. For example, in 
certain embodiments, it may be desirable to pre-incubate the test agent and the E3 
(e.g. the POSH polypeptide), followed by removing the test agent and addition of 
other components to complete the assay. In this manner, the effects of the agent 
solely on the POSH polypeptide may be assessed. In certain preferred 

30 embodiments, a screening assay for an antiviral agent employs a target polypeptide 
comprising an L domain, and preferably an HIV L domain. 
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In certain embodiments, an assay is performed in a high-throughput format. 
For example, one of the components of a mixture may be affixed to a solid substrate 
and one or more of the other components is labeled. For example, the POSH 
polypeptide may be affixed to a surface, such as a 96-weU plate, and the ubiquitin is 
5 in solution and labeled. An E2 and El are also in solution, and the POSH-ubiquitin 
conjugate formation may be measured by washing die solid surface to remove 
uncomplexed labeled ubiquitin and detecting the ubiquitin that remains bound. 
Other variations may be used. For example, the amount of ubiquitin in solution may 
be detected. In certain embodiments, the formation of ubiquitin complexes may be 

10 measured by an interactive technique, such as FRET, wherein a ubiquitin is labeled 
with a first label and the desired complex partner (e.g. POSH polypeptide or target 
polypeptide) is labeled with a second label, wherein the first and second label 
interact when they come into close proximity to produce an altered signal. In 
FRET, die first and second labels are fluorophores. FRET is described in greater 

15 detail below. The formation of polyubiquitin complexes may be performed by 
mixing two or more pools of differentially labeled ubiquitin that interact upon 
formation of a polyubiqutin (see, e.g. US . Patent Publication 20020042083). High- 
throughput may be achieved by performing an interactive assay, such as FRET, in 
solution as well. In addition, if a polypeptide in the mixture, such as the POSH 

20 polypeptide or target polypeptide, is readily purifiable (e.g. with a specific antibody 
or via a tag such as biotin, FLAG, polyhistidine, etc.), the reaction may be 
performed in solution and the tagged polypeptide rapidly isolated, along with any 
polypeptides, such as ubiquitin, that are associated with the tagged polypeptide. 
Proteins may also be resolved by SDS-PAGE for detection. 

25 In certain embodiments, the ubiquitin is labeled, either directly or indirectly. 

This typically allows for easy and rapid detection and measurement of ligated 
ubiquitin, making the assay useful for high-throughput screening applications. As 
descrived above, certain embodiments may employ one or more tagged or labeled 
proteins. A "tag" is meant to include moieties that facilitate rapid isolation of the 

30 tagged polypeptide. A tag may be used to facilitate attachment of a polypeptide to a 
surface. A "label" is meant to include moieties that facilitate rapid detection of the 
labeled polypeptide. Certain moieties may be used both as a label and a tag (e.g. 
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epitope tags that are readily purified and detected with a well-characterized 
antibody). Biotinylation of polypeptides is well known, for example, a large number 
of biotinylation agents are known, including amine-reactive and thiol-reactive 
agents, for the biotinylation of proteins, nucleic acids, carbohydrates, carboxylic 
5 acids; see chapter 4, Molecular Probes Catalog, Haugland, 6th Ed. 1996, hereby 
incorporated by reference. A biotinylated substrate can be attached to a biotinylated 
component via avidin or streptavidin. Similarly, a large number of haptenylation 
reagents are also known. 

An "El" is a ubiquitin activating enzyme. In a preferred embodiment, El is 
10 capable of transferring ubiquitin to an E2. In a preferred embodiment, El forms a 
high energy thiolester bond with ubiquitin, thereby "activating" the ubiquitin. An 
"E2" is a ubiquitin carrier enzyme (also known as a ubiquitin conjugating enzyme). 
In a preferred embodiment, ubiquitin is transferred from El to E2. In a preferred 
embodiment, the transfer results in a thiolester bond formed between E2 and 
IS ubiquitin. In a preferred embodiment, E2 is capable of transferring ubiquitin to a 
POSH polypeptide. 

In an alternative embodiment, a POSH polypeptide, E2 or target polypeptide 
is bound to a bead, optionally with the assistance of a tag. Following ligation, the 
beads may be separated from the unbound ubiquitin and the bound ubiquitin 

20 measured. In a preferred embodiment, POSH polypeptide is bound to beads and the 
composition used includes labeled ubiquitin. In this embodiment, the beads with 
bound ubiquitin may be separated using a fluorescence-activated cell sorting 
0? ACS) machine- Methods for such use are described in U.S. patent application Ser. 
No. 09/047,119, which is hereby incorporated in its entirety. The amount of bound 

25 ubiquitin can then be measured. 

In a screening assay, the effect of a test agent may be assessed by, for 
example, assessing the effect of the test agent on kinetics, steady-state and/or 
endpoint of the reaction. 

The components of the various assay mixtures provided herein may be 
30 combined in varying amounts. In a preferred embodiment, ubiquitin (or E2 
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complexed ubiquitin) is combined at a final concentration of from 5 to 200 ng per 
100 microliter reaction solution. Optionally El is used at a final concentration of 
from 1 to 50 ng per 100 microliter reaction solution. Optionally E2 is combined at a 
final concentration of 10 "to 100 ng per 100 microliter reaction solution, more 
5 preferably 10-50 ng per 100 microliter reaction solution. In a preferred embodiment, 
POSH polypeptide is combined at a final concentration of from 1 ng to 500 ng per 
100 microliter reaction solution. 

Generally, an assay mixture is prepared so as to favor ubiquitin ligase 
activity and/or ubiquitination acitivty. Generally, this will be physiological 

10 conditions, such as 50 - 200 mM salt (e.g. NaCl, KC1), pH of between 5 and 9, and 
preferably between 6 and 8. Such conditions may be optimized through trial and 
error. Incubations may be performed at any temperature which facilitates optimal 
activity, typically between 4 and 40 degrees C. Incubation periods are selected for 
optimum activity, but may also be optimized to facilitate rapid high through put 

15 screening. Typically between 0.5 and 1.5 hours will be sufficient A variety of other 
reagents may be included in the compositions. These include reagents like salts, 
solvents, buffers, neutral proteins, e.g. albumin, detergents, etc, which may be used 
to facilitate optimal ubiquitination enzyme activity and/or reduce non-specific or 
background interactions. Also reagents that otherwise improve the efficiency of the 

20 assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., 
may be used. The compositions will also preferably include adenosine tri-phosphate 
(ATP). The mixture of components may be added in any order that promotes 
ubiquitin ligase activity or optimizes identification of candidate modulator effects. In 
a preferred embodiment, ubiquitin is provided in a reaction buffer solution, followed 

25 by addition of the ubiquitination enzymes. In an alternate preferred embodiment, 
ubiquitin is provided in a reaction buffer solution, a candidate modulator is then 
added, followed by addition of the ubiquitination enzymes. 

In general, a test agent that decreases a POSH ubiquitin-related activity may 
be used to inhibit POSH function in vivo, while a test agent that increases a POSH 
30 ubiquitin-related activity may be used to stimulate POSH function in vivo. Test 
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agent may be modified for use in vivo, e.g. by addition of a hydrophobic moiety, 
such as an ester. 

An additional POSH-AP may be added to a POSH ubiquitination assay to 
assess the effect of the POSH-AP on POSH-mediated ubiquitination and/or to assess 
5 whether the POSH-AP is a target for POSH-mediated ubiquitination. 

Certain embodiments of the application relate to assays for identifying agents 
that bind to a POSH or POSH-AP polypeptide, optionally a particular domain of 
POSH such as an SH3 or RING domain or a particular domain of a POSH-AP, such 
as a BTB/POZ domain of an MSTP028 or a ubiquitin-like domain of a HERPUD1. 

10 In preferred embodiments, a POSH polypeptide is a polypeptide comprising the 
fourth SH3 domain of hPOSH (SEQ ID NO: 30). A wide variety of assays may be 
used for this purpose, including labeled in vitro protein-protein binding assays, 
electrophoretic mobility shift assays, immunoassays for protein binding, and the 
like. The purified protein may also be used for determination of three-dimensional 

15 crystal structure, which can be used for modeling intermolecular interactions and 
design of test agents. In one embodiment, an assay detects agents which inhibit 
interaction of one or more subject POSH polypeptides with a POSH-AP. In another 
embodiment, the assay detects agents which modulate the intrinsic biological 
activity of a POSH polypeptide or POSH complex, such as an enzymatic activity, 

20 binding to other cellular components, cellular compartmentalization, and the like. 

In one aspect, the application provides methods and compositions for the 
identification of compositions that interfere with the function of POSH or POSH-AP 
polypeptides. Given the role of POSH polypeptides in viral production, 
compositions that perturb the formation or stability of the protein-protein 

25 interactions between POSH polypeptides and the proteins that they interact with, 
such as POSH-APs, and particularly POSH complexes comprising a viral protein, 
are candidate pharmaceuticals for the treatment of viral infections. 

While not wishing to be bound to mechanism, it is postulated that POSH 
polypeptides promote the assembly of protein complexes that are important in 

30 release of virions and other biological processes. Complexes of the application may 
include a combination of a POSH polypeptide and a POSH-AP. Exemplary 
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N complexes may comprise one or more of the following: a POSH polypeptide (as in 
the case of a POSH dimer, a heterodiraer of two different POSH, homomultimers 
and heteromultimers); an UNC48, an MSTP028, an HERPUD1, a GTPase (eg. Rac, 
Racl, Rho, Ras); an E2 enzyme; ubiquitin, or optionally, a cullin; a clathrin; AP-1; 

5 AP-2; an HSP70; an HSP90, Brcal, Bardl, Nef, PAK1, PAK2, PAK family, Vav, 
Cdc42, PI3K (e.g. p85 or pi 10), Nedd4, src (src family), TsglOl, VASP, RNB6, 
WASP, N-WASP, a Gag, particularly an HIV Gag (e.g. pl60); and KIAA0674, 
Similar to Spred-2, as well as, in certain embodiments, proteins known to be 
associated with clathrin-coated vesicles and or proteins involved in the protein 

10 sorting pathway. 

The type of complex foimed by a POSH polypeptide will depend upon the 
domains present in the protein. While not intended to be limiting, exemplary 
domains of potential interacting proteins are provided below. A RING domain is 
expected to interact with cullins, E2 enzymes, AP-1, AP-2, and/or a substrate for 

15.. ubiquitylation (e.g. in some instances, a protein comprising a Gag L domain or a 
Gag polypeptide such as Gag-Pol, such as HIV pl60). An SH3 domain may interact 
with Gag L domains and other proteins having the sequence motif P(T/S)AP, 
RXXP(T/S)AP, PXXDY, PXXP, PPXY or RXXPXXP, such as, for example, an 
HIV Gag sequence such as RQGPKEPFR, PFRDY, PTAP and RPEPTAP. 

20 In a preferred assay for an antiviral or antiapoptotic agent, the test agent is 

assessed for its ability to disrupt or inhibit the formation of a complex of a POSH 
polypeptide and a Rac polypeptide, particularly a human Rac polypeptide, such as 
Racl. 

A variety of assay formats will suffice and, in light of the present disclosure, 
25 those not expressly described herein will nevertheless be comprehended by one of 
ordinary skill in the art Assay formats which approximate such conditions as 
formation of protein complexes, enzymatic activity, and even a POSH polypeptide- 
mediated membrane reorganization or vesicle formation activity, may be generated 
in many different forms, and include assays based on cell-free systems, e.g. purified 
30 proteins or cell lysates, as well as cell-based assays which utilize intact cells. 
Simple binding assays can also be used to detect agents which bind to POSH. Such 
binding assays may also identify agents that act by disrupting the interaction 
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between a POSH polypeptide and a POSH interacting protein, or the binding of a 
POSH polypeptide or complex to a substrate. Agents to be tested can be produced, 
for example, by bacteria, yeast or other organisms (e.g. natural products), produced 
chemically (e.g. small molecules, including peptidomimetics), or produced 
5 recombinantly. In a preferred embodiment, the test agent is a small organic 
molecule, e.g., other than a peptide or oligonucleotide, having a molecular weight of 
less than about 2,000 daltons. 

In many drug screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the 

10 number of compounds surveyed in a given period of time. Assays of the present 
application which are performed in cell-free systems, such as may be developed with 
purified or semi-purified proteins or with lysates, are often preferred as "primary" 
screens in that they can be generated to permit rapid development and relatively easy 
detection of an alteration in a molecular target which is mediated by a test 

1 5 compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead 
being focused primarily on the effect of the. drug on the molecular target as may be 
manifest in an alteration of binding affinity with other proteins or changes in 
enzymatic properties of the molecular target. 

20 In preferred in vitro embodiments of the present assay, a reconstituted POSH 

complex comprises a reconstituted mixture of at least semi-purified proteins. By 
semi-purified, it is meant that the proteins utilized in the reconstituted mixture have 
been previously separated from other cellular or viral proteins. For instance, in 
contrast to cell lysates, the proteins involved in POSH complex formation are 

25 present in the mixture to at least 50% purity relative to all other proteins in the 
mixture, and more preferably are present at 90-95% purity. In certain embodiments 
of the subject method, the reconstituted protein mixture is derived by mixing highly 
purified proteins such that the reconstituted mixture substantially lacks other 
proteins (such as of cellular or viral origin) which might interfere with or otherwise 

30 alter the ability to measure POSH complex assembly and/or disassembly. 
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Assaying POSH complexes, in the presence and absence of a candidate 
inhibitor, can be accomplished in any vessel suitable for containing the reactants. 
Examples include microtitre plates, test tubes, and micro-centrifuge tubes. 

In one embodiment of the present application, drug screening assays can be 
5 generated which detect inhibitory agents on the basis of their ability to interfere with 
assembly or stability of the POSH complex. In an exemplary binding assay, the 
compound of interest is contacted with a mixture comprising a POSH polypeptide 
and at least one interacting polypeptide. Detection and quantification of POSH 
complexes provides a means for determining the compound's efficacy at inhibiting 
10 (or potentiating) interaction between the two polypeptides. The efficacy of the 
compound can be assessed by generating dose response curves from data obtained 
using various concentrations of the test compound. Moreover, a control assay can 
also be performed to provide a baseline for comparison. In the control assay, the 
formation of complexes is quantitated in the absence of the test compound. 
15- Complex formation between the POSH polypeptides and a substrate 

polypeptide may be detected by a variety of techniques, many of which are 
effectively described above. For instance, modulation in the formation of complexes 
can be quantitated using, for example, detectably labeled proteins (e.g. radiolabeled, 
fluorescently labeled, or enzymatically labeled), by immunoassay, or by 
20 chromatographic detection. Surface plasmon resonance systems, such as those 
available from Biacore International AB (Uppsala, Sweden), may also be used to 
detect protein-protein interaction 

Often, it will be desirable to immobilize one of the polypeptides to facilitate 
separation of complexes from uncomplexed forms of one of the proteins, as well as 
25 to accommodate automation of the assay. In an illustrative embodiment, a fusion 
protein can be provided which adds a domain that permits the protein to be bound to 
an insoluble matrix. For example, GST-POSH fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St Louis, MO) or glutathione 
derivatized microtitre plates, which are then combined with a potential interacting 
30 protein, e.g. an 35S-labeled polypeptide, and the test compound and incubated under 
conditions conducive to complex formation . Following incubation, the beads are 
washed to remove any unbound interacting protein, and the matrix bead-bound 
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radiolabel detennined directly (e.g. beads placed in scintillant), or in the supernatant 
after the complexes are dissociated, e.g. when microtitre plate is used. Alternatively, 
after washing away unbound protein, the complexes can be dissociated from the 
matrix, separated by SDS-PAGE gel, and the level of interacting polypeptide found 
5 in the matrix-bound fraction quantitated from the gel using standard electrophoretic 
techniques. 

In a further embodiment, agents that bind to a POSH or POSH-AP may be 
identified by using an immobilized POSH or POSH-AP. In an illustrative 
embodiment, a fusion protein can be provided which adds a domain that permits the 

10 protein to be bound to an insoluble matrix. For example, GST-POSH fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. 
Louis, MO) or glutathione derivatized microtitre plates, which are then combined 
with a potential labeled binding agent and incubated under conditions conducive to 
binding. Following incubation, the beads are washed to remove any unbound agent, 

15 and the matrix bead-bound label determined directly, or in the supernatant after the 
bound agent is dissociated. 

In yet another embodiment, the POSH polypeptide and potential interacting 
polypeptide can be used to generate an interaction trap assay (see also, U.S. Patent 
NO: 5,283,317; Zervos et aL (1993) Cell 72:223-232; Madura et al. (1993) J Biol 

20 Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and 
Iwabuchi et al. (1993) Oncogene 8:1693-1696), for subsequently detecting agents 
which disrupt binding of the proteins to one and other. 

In particular, the method makes use of chimeric genes which express hybrid 
proteins. To illustrate, a first hybrid gene comprises the coding sequence for a 

25 DNA-binding domain of a transcriptional activator can be fused in frame to the 
coding sequence for a "bait" protein, e.g., a POSH polypeptide of sufficient length to 
bind to a potential interacting protein. The second hybrid protein encodes a 
transcriptional activation domain fused in frame to a gene encoding a "fish" protein, 
e.g., a potential interacting protein of sufficient length to interact with the. POSH 

30 polypeptide portion of the bait fusion protein. If the bait and fish proteins are able to 
interact, e.g., form a POSH complex, they bring into close proximity the two 
domains of the transcriptional activator. This proximity causes transcription of a 
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reporter gene which is operably linked to a transcriptional regulatory site responsive 
to the transcriptional activator, and expression of the reporter gene can be detected 
and used to score for the interaction of the bait and fish proteins. 

In accordance with the present application, the method includes providing a 

5 host cell, preferably a yeast cell, e.g., Kluyverei lactis, Schizosaccharomyces pombe, 
Ustilago maydis, Saccharomyces cerevisiae, Neurospora crassa, Aspergillus niger, 
Aspergillus nidulans, Pichia pastoris, Candida tropicalis, and Hansenula 
polymorpha, though most preferably S cerevisiae or S. pombe. The host cell 
contains a reporter gene having a binding site for the DNA-binding domain of a 

10 transcriptional activator used in the bait protein, such that the reporter gene 
expresses a detectable gene product when the gene is transcriptionally activated. The 
first chimeric gene may be present in a chromosome of the host cell, or as part of an 
expression vector. Interaction trap assays may also be performed in mammalian and 
bacterial cell types. 

. - 15 The host cell also contains a first chimeric gene which is capable of being 

expressed in the host cell. The gene encodes a chimeric protein, which comprises (i) 
a DNA-binding domain that recognizes the responsive. element on the reporter gene 
• in the host cell, and (ii) a bait protein, such as a POSH polypeptide sequence. 

A second chimeric gene is also provided which is capable of being expressed 
20 in the host cell, and encodes the "fish" fusion protein. In one embodiment, both the 
first and the second chimeric genes are introduced into the host cell in the form of 
plasmids. Preferably, however, the first chimeric gene is present in a chromosome 
of the host cell and the second chimeric gene is introduced into the host cell as part 
of a plasmid. 

25 Preferably, the DNA-binding domain of the first hybrid protein and the 

transcriptional activation domain of the second hybrid protein are derived from 
transcriptional activators having separable DNA-binding and transcriptional 
activation domains. For instance, these separate DNA-binding and transcriptional 
activation domains are known to be found in the yeast GAL4 protein, and are known 

30 to be found in the yeast GCN4 and ADR1 proteins. Many other proteins involved in 
transcription also have separable binding and transcriptional activation domains 
which make them useful for the present application, and include, for example, the 
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LexA and VP 16 proteins. It will be understood that other (substantially) 
transcriptionally-inert DNA-binding domains may be used in the subject constructs; 
such as domains of ACE1, lcl, lac repressor, jun or fos. In another embodiment, the 
DNA-binding domain and the transcriptional activation domain may be from 
5 different proteins. The use of a LexA DNA binding domain provides certain 
advantages. For example, in yeast, the LexA moiety contains no activation function 
and has no known effect on transcription of yeast genes. In addition, use of LexA 
allows control over the sensitivity of the assay to the level of interaction (see, for 
example, the Brent et al. PCT publication WO94/10300). 
10 In preferred embodiments, any enzymatic activity associated with the bait or 

fish proteins is inactivated, e.g., dominant negative or other mutants of a POSH 
polypeptide can be used. 

Continuing with the illustrated example, the POSH polypeptide-mediated 
interaction, if any, between the bait and fish fusion proteins in the host cell, 
15 therefore, causes the activation domain to activate transcription of the reporter gene. 
The method is carried out by introducing the first chimeric gene and the second 
chimeric gene into the host cell, and subjecting that cell to. conditions under which 
the bait and fish fusion proteins and are expressed in sufficient quantity for the 
reporter gene to be activated. The formation of a POSH - POSH-AP complex results 
20 in a detectable signal produced by the expression of the reporter gene. Accordingly, 
the level of formation of a complex in the presence of a test compound and in the 
absence of the test compound can be evaluated by detecting the level of expression 
of the reporter gene in each case. Various reporter constructs may be used in accord 
with the methods of the application and include, for example, reporter genes which 
25 produce such detectable signals as selected from the group consisting of an 
enzymatic signal, a fluorescent signal, a phosphorescent signal and drug resistance. 

One aspect of the present application provides reconstituted protein 
preparations including a POSH polypeptide and one or more interacting 
polypeptides. 

30 In still further embodiments, of the present assay, the POSH complex is 

generated in whole cells, taking advantage of cell culture techniques to support the 
subject assay. For example, as described below, the POSH complex can be 
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constituted in a eukaryotic cell culture system, including mammalian and yeast cells. 
Often it will be desirable to express one or more viral proteins (eg. Gag or Env) in 
such a cell along with a subject POSH polypeptide. It may also be desirable to 
infect the cell with a virus of interest. Advantages to generating the subject assay in 
an intact cell include the ability to detect inhibitors which are functional in an 
environment more closely approximating that which therapeutic use of the inhibitor 
would require, including the ability of the agent to gain entry into the cell. 
Furthermore, certain of the in vivo embodiments of the assay, such as examples 
given below, are amenable to high through-put analysis of candidate agents. 

The components of the POSH complex can be endogenous to the cell 
selected to support the assay. Alternatively, some or all of the components can be 
derived from exogenous sources. For instance, fusion proteins can be introduced 
into the cell by recombinant techniques (such as through the use of an expression 
vector), as well as by microinjecting the fusion protein itself or mRNA encoding the 
fusion protein- - .?-'••■ 

In many embodiments, a cell is manipulated after incubation with a 
candidate agent and assayed for a POSH or POSH-AP activity. In certain 
embodiments a POSH or POSH-AP activity is represented by production of virus 
like particles. As demonstrated herein, an agent that disrupts POSH or POSH-AP 
activity can cause a decrease in the production of virus like particles. Other 
bioassays for POSH or POSH-AP activities may include apoptosis assays (e.g. cell 
survival assays, apoptosis reporter gene assays, etc.) and NF-kB nuclear localization 
assays (see e.g. Tapon et al. (1998) EMBO J. 17: 1395-1404). In certain 
embodiments, POSH or POSH-AP activities may include, without limitation, 
complex formation, ubiquitination and membrane fusion events (eg. release of viral 
buds or fusion of vesicles). POSH complex formation may be assessed by 
immunoprecipitation and analysis of co-immunoprecipiated proteins or affinity 
purification and analysis of co-purified proteins. Fluorescence Resonance Energy 
Transfer (FRET)-based assays may also be used to determine complex formation. 
Fluorescent molecules having the proper emission and excitation spectra that are 
brought into close proximity with one another can exhibit FRET. The fluorescent 
molecules are chosen such that the emission spectrum of one of the molecules (the 
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donor molecule) overlaps with the excitation spectrum of the other molecule (the 
acceptor molecule). The donor molecule is excited by light of appropriate intensity 
within the donor's excitation spectrum. The donor then emits the absorbed energy as 
fluorescent light The fluorescent energy it produces is quenched by the acceptor 
5 molecule. FRET can be manifested as a reduction in the intensity of the fluorescent 
signal from the donor, reduction in the lifetime of its excited state, and/or re- 
emission of fluorescent light at the longer wavelengths (lower energies) 
characteristic of the acceptor. When the fluorescent proteins physically separate, 
FRET effects are diminished or eliminated. (U.S. Patent No. 5,981,200). 
10 For example, a cyan fluorescent protein is excited by light at roughly 425 - 

450 nm wavelength and emits light in the range of 450 - 500 nm. Yellow 
fluorescent protein is excited by light at roughly 500 - 525 nm and emits light at 525 
- 500 nm. If these two proteins are placed in solution, the cyan and yellow 
fluorescence may be separately visualized. However, if these two proteins are 
15 forced into close proximity with each other, the fluorescent properties will be altered 
by FRET. The bluish light emitted by CFP will be absorbed by \FP and re-emitted 
as yellow light. This means that when the proteins are stimulated with light at 
wavelength 450 nm, the cyan emitted light is greatly reduced and the yellow light, 
which is not normally stimulated at this wavelength, is greatly increased. FRET is 
20 typically monitored by measuring the spectrum of emitted light in response to 
stimulation with light in the excitation range of the donor and calculating a ratio 
between the donor-emitted light and the acceptor-emitted light. When the 
dononacceptor emission ratio is high, FRET is not occurring and the two fluorescent 
proteins are not in close proximity. When the donor: acceptor emission ratio is low, 
25 FRET is occurring and the two fluorescent proteins aTe in close proximity. In this 
manner, the interaction between a first and second polypeptide may be measured. 

The occurrence of FRET also causes the fluorescence lifetime of the donor 
fluorescent moiety to decrease. This change in fluorescence lifetime can be 
measured using a technique termed fluorescence lifetime imaging technology 
30 (FLIM) (Verveer et aL (2000) Science 290: 1567-1570; Squire et al. (1999) Jl 
Microsc. 193: 36; Verveer et aL (2000) Biophys. J. 78: 2127). Global analysis 
techniques for analyzing FLIM data have been developed. These algorithms use the 
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understanding that the donor fluorescent moiety exists in only a limited number of 
states each with a distinct fluorescence lifetime. Quantitative maps of each state can 
be generated on a pixel-by-pixel basis. 

To perform FRET-based assays, the POSH polypeptide and the interacting 
5 protein of interest are both fluorescently labeled. Suitable fluorescent labels are, in 
view of this specification, well known in the art. Examples are provided below, but 
suitable fluorescent labels not specifically discussed are also available to those of 
skill in the art. Fluorescent labeling may be accomplished by expressing a 
polypeptide as a fusion protein with a fluorescent protein, for example fluorescent 

10 proteins isolated from jellyfish, corals and other coelenterates. Exemplary 
fluorescent proteins include the many variants of the green fluorescent protein (GFP) 
of Aequoria victoria. Variants may be brighter, dimmer, or have different excitation 
and/or emission spectra. Certain variants are altered such that they no longer appear 
green, and may appear blue, cyan, yellow or red (termed BFP, CFP, YFP and RFP, 

1 5 respectively). Fluorescent proteins may be stably attached to polypeptides through a 
variety of covalent and noncovalent linkages, including, for example, peptide bonds 
(eg. expression as a fusion protein),, chemical cross-linking and biotin-streptavidin 
coupling. For examples of fluorescent proteins, see U.S. Patents 5,625,048; 
5,777,079; 6,066,476; 6,124,128; Prasher et al. (1992) Gene, 1 1 1:229-233; Heim et 

20 al. {1994) Proc.NatL Acad ScL, USA, 91:12501-04; Wardet al. (1982) Photochem. 
PhotobioL, 35:803-808 ; Levine et al. (1982) Comp. Biochem. Physiol, 72B:77-85; 
Tersikh et al. (2000) Science 290: 1585-88. 

Other exemplary fluorescent moieties well known in the art include 
derivatives of fluorescein, benzoxadioazole, coumarin, eosin, Lucifer Yellow, 

25 pyridyloxazole and rhodamine. These and many other exemplary fluorescent 
moieties may be found in the Handbook of Fluorescent Probes and Research 
Chemicals (2000, Molecular Probes, Inc.), along with methodologies for modifying 
polypeptides with such moieties. Exemplary proteins that fluoresce when combined 
with a fluorescent moiety include, yellow fluorescent protein from Vibrio fischeri 

30 (Baldwin et al. (1990) Biochemistry 29:5509-15), peridinin-chlorophyll a binding 
protein from the dinoflagellate Symbiodinium sp. (Morris et al. (1994) Plant 
Molecular Biology 24:673:77) and phycobiliproteins from marine cyanobacteria 
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such as Synechococcus, e.g., phycoerythrin and phycocyanin (Wilbanks et al. (1993) 
J. BioL Chem. 268:1226-35). These proteins require flavins, peridinin-chlorophyll a 
and various phycobilins, respectively, as fluorescent co-factors. 

FRET-based assays may be used in cell-based assays and in cell-free assays. 
5 FRET-based assays are amenable to high-throughput screening methods including 
Fluorescence Activated Cell Sorting and fluorescent scanning of microtiter arrays. 

In a further embodiment, transcript levels may be measured in cells having 
higher or lower levels of POSH or POSH-AP activity in order to identify genes that 
are regulated by POSH or POSH-APs. Promoter regions for such genes (or larger 
10 portions of such genes) may be operatively linked to a reporter gene and used in a 
reporter gene-based assay to detect agents that enhance or diminish POSH- or 
POSH-AP-regulated gene expression. Transcript levels may be determined in any 
way known in the art, such as, for example, Northern blotting, RT-PCR, microarray, 
etc. Increased POSH activity may be achieved, for example, by introducing a strong 
15 POSH expression vector. Decreased. POSH activity may be achieved, for example, 
by RNAi, antisense, ribozyme, gene knockout, etc.* 

In . general, where the screening assay is i binding assay (whether protein- 
<protein binding, agent-protein binding, etc.), one or more of the molecules may be 
joined to a label, where the label can directly or indirectly provide a detectable 
20 signal. Various labels include radioisotopes, fluorescers, chemiluminescers, 
enzymes, specific binding molecules, particles, e.g. magnetic particles, and the like. 
Specific binding molecules include pairs, such as biotin and streptavidin, digoxin 
and antidigoxin etc. For the specific binding members, the complementary member 
would normally be labeled with a molecule that provides for detection, in 
25 accordance with known procedures. 

In further embodiments, the application provides methods for identifying 
targets for therapeutic intervention. A polypeptide that interacts with POSH or 
participates in a POSH-mediated process (such as viral maturation) may be used to 
identify candidate therapeutics. Such targets may be identified by identifying 
30 proteins that associated with POSH (POSH-APs) by, for example, 
immunoprecipitation with an anti-POSH antibody, in silico analysis of high- 
throughput binding data, two-hybrid screens, and other protein-protein interaction 
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assays described herein or otherwise known in the art in view of this disclosure. 
Agents that bind to such targets or disrupt protein-protein interactions thereof, or 
inhibit a biochemical activity thereof may be used in such an assay. Targets that 
may be identified by such approaches include: an UNC84, an MSTP028, an 
5 HERPUD1, a GTPase (eg. Rac, Racl, Rho, Ras); an E2 enzyme, a cullin; a clathrin; 
AP-1; AP-2; an HSP70; an HSP90, Brcal, Bardl, Nef, PAK1, PAK2, PAK family, 
Vav, Cdc42, PI3K (e.g. p85 or pi 10), Nedd4, src (src family), TsglOl, VASP, 
RNB6, WASP, N-WASP, a Gag, particularly an HIV Gag (e.g. pi 60); and 
KIAA0674, Similar to Spred-2, as well as, in certain embodiments, proteins known 

10 to be associated with clathrin-coated vesicles, proteins involved in the protein 
sorting pathway and proteins involved in a Rac signaling pathway. 

A variety of other reagents may be included in the screening assay. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc that are 
used to facilitate optimal protein-protein binding and/or reduce nonspecific or 

15 background interactions. Reagents that improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti- microbial agents, etc. may be used. The 
mixture of components are added in any order that provides , for the requisite 
binding. Incubations are performed at any suitable temperature, typically between 4° 
and 40° C. Incubation periods are selected for optimum activity, but may also be 

20 optimized to facilitate rapid high-throughput screening. 

In certain embodiments, a test agent may be assessed for its ability to perturb 
the localization of a POSH polypeptide, e.g. preventing POSH localization to the 
nucleus and/ or the Golgi network. 

25 10. Methods and Compositions for Treatment of Viral Disorders 

In a further aspect, the application provides methods and compositions for 
treatment of viral disorders, and particularly disorders caused by retroid viruses, 
RNA viruses and/or envelope viruses, including but not limited to retroviruses, 
rhabdoviruses, lentiviruses, and filoviruses. Preferred therapeutics of the application 

30 function by disrupting the biological activity of a POSH polypeptide or POSH 
complex in viral maturation. Certain therapeutics of the application function by 
disrupting the activity of a POSH-AP. 



-94- 



Exemplary therapeutics of the application include nucleic acid therapies such 
as for example RNAi constructs, antisense oligonucleotides, ribozyme, and DNA 
enTymes. Other therapeutics include polypeptides, peptidomimetics, antibodies and 
small molecules. 

5 Antisense therapies of the application include methods of introducing 

antisense nucleic acids to disrupt the expression of POSH polypeptides or proteins 
that are necessary for POSH function, such as certain POSH-APs. 

RNAi therapies include methods of introducing RNAi constructs to 
downregulate the expression of POSH polypeptides or proteins that are necessary for 

1 0 POSH function. Exemplary RNAi therapeutics include any one of SEQ ID Nos: 1 5, 
16,18, 19, 21, 22, 24 and 25. 

Therapeutic polypeptides may be generated by designing polypeptides to 
mimic certain protein domains important in the formation of POSH complexes, such 
as, for example SH3 or RING domains. For example, a polypeptide comprising a 

1 5 POSH SH3 domain such as for example the SH3 domain as set forth in SEQ ID No: 
30 will compete for binding to a POSH SH3 domain and will therefore act to disrupt 
binding of a partner protein. In one embodiment, a binding partner may be a Gag 
polypeptide. In another embodiment, a binding partner may be Rac. In a further 
embodiment, a polypeptide that resembles an L domain may disrupt recruitment of 

20 Gag to the POSH complex. 

In view of the specification, methods for generating antibodies directed to 
epitopes of POSH and POSH-APs are known in the art. Antibodies may be 
introduced into cells by a variety of methods. One exemplary method comprises 
generating a nucleic acid encoding a single chain antibody that is capable of 

25 disrupting a POSH complex. Such a nucleic acid may be conjugated to antibody 
that binds to receptors on the surface of target cells. It is contemplated that in 
certain embodiments, the antibody may target viral proteins that are present on the 
surface of infected cells, and in this way deliver the nucleic acid only to infected 
cells. Once bound to the target cell surface, the antibody is taken up by endocytosis, 

30 and the conjugated nucleic acid is transcribed and translated to produce a single 

chain antibody that interacts with and disrupts the targeted POSH complex. Nucleic 
acids expressing the desired single chain antibody may also be introduced into cells 
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using a variety of more conventional techniques, such as viral transfection (eg. using 
an adenoviral system) or liposome-mediated transfection. 

Small molecules of the application may be identified for their ability to 
modulate the formation of POSH complexes, as described above. 
5 In view of the teachings herein, one of skill in the art will understand that the 

methods and compositions of the application are applicable to a wide range of 
viruses such as for example retroid viruses, RNA viruses, and envelope viruses. In a 
preferred embodiment, the present application is applicable to retroid viruses. In a 
more preferred embodiment, the present application is further applicable to 

10 retroviruses (retroviridae). In another more preferred embodiment, the present 

application is applicable to lentivirus, including primate lentivirus group. In a most 
preferred embodiment, the present application is applicable to Human 
Immunodeficiency virus (HTV), Human Immunodeficiency virus type-1 (HIV-1), 
Hepatitis B Virus (HBV) and Human T-cell Leukemia Virus (HTLV). 

1 5 While not intended to be -limiting, relevant retroviruses include: C-type . 

retrovirus which causes lymphosarcoma in Northern Pike, the C-type retrovirus 
which infects mink, the caprine lentivirus which.infects sheep, the Equine Infectious 
Anemia Virus (EIAV), the C-type retrovirus which infects pigs, the Avian Leukosis 
Sarcoma Virus (ALSV), the Feline Leukemia Virus (FeLV), the Feline Aids Virus, 

20 the Bovine Leukemia Virus (BLV), the Simian Leukemia Virus (SLV), the Simian 
Immuno-deficiency Virus (SIV), the Human T-cell Leukemia Virus type-I (HTLV- 
I), the Human T-cell Leukemia Virus type-II (HTLV-II), Human Immunodeficiency 
virus type-2 (HTV-2) and Human Immunodeficiency virus type-1 (HIV-1). 
Hie method and compositions of the present application are further 

25 applicable to RNA viruses, including ssRNA negative-strand viruses and ssRNA 
positive-strand viruses. The ssRNA positive-strand viruses include Hepatitis C 
Virus (HCV). In a preferred embodiment, the present application is applicable to 
mononegavirales, including filoviruses. Filoviruses further include Ebola viruses 
and Marburg viruses. 

30 Other RNA viruses include picornaviruses such as enterovirus, poliovirus, 

coxsackievirus and hepatitis A virus, the calici viruses, including Norwalk-like 
viruses, the rhabdoviruses, including rabies virus, the togaviruses including 
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alphaviruses, Semliki Forest virus, denguevirus, yellow fever virus and rubella virus, 
the orthomyxoviruses, including Type A, B, and C influenza viruses, the 
bunyaviruses, including the Rift Valley fever virus and the hantavirus, the 
filoviruses such as Ebola virus and Marburg virus, and the paramyxoviruses, 
5 including mumps virus and measles virus. Additional viruses that may be treated 
include herpes viruses. 

11. Effective Dose 

Toxicity and therapeutic efficacy of such compounds can be determined by 

10 standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining The Ld50 (The Dose Lethal To 50% Of The Population) And The 
Ed50 (the dose therapeutically effective in 50% of the population). The dose ratio 
between toxic and therapeutic effects is the therapeutic index and it can be expressed 
as the ratio LD50/ED50. Compounds which exhibit large therapeutic induces are 

15 : preferred. While compounds that exhibit toxic side effects may lie used, care should 
be taken to design a delivery system that targets such compounds to the site of 
affected tissue in order to 'minimize potential damage to uninfected cells and, 
thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used 

20 in formulating a range of dosage for use in humans. The dosage of such compounds 
lies preferably within a range of circulating concentrations that include the ED50 
with little or no toxicity. The dosage may vary within this range depending upon the 
dosage form employed and the route of administration utilized. For any compound 
used in the method of the application, the therapeutically effective dose can be 

25 estimated initially from cell culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range that includes the IC50 
(i.e., the concentration of the test compound which achieves a half-maximal 
inhibition of symptoms) as determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. Levels in plasma may be 

30 measured, for example, by high performance liquid chromatography. 

12. Formulation and Use 
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Pharmaceutical compositions for use in accordance with the present 
application may be formulated in conventional manner using one or more 
physiologically acceptable carriers or excipients. Thus, the compounds and their 
physiologically acceptable salts and solvates may be formulated for administration 

5 by, for example, injection, inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

An exemplary composition of the application comprises an RNAi mixed 
with a delivery system, such as a liposome system, and optionally including an 
acceptable excipient. In a preferred embodiment, the composition is formulated for 

10 topical administration for, e.g. herpes virus infections. 

For such therapy, the compounds of the application can be formulated for a 
variety of loads of administration, including systemic and topical or localized 
administration. Techniques and formulations generally may be found in 
Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, PA. For 
. 1 5 systemic administration, injection is» preferred, including intramuscular, intravenous, 
intraperitoneal, and subcutaneous.. For injection, the compounds of the application 
can be formulated , in liquid solutions, preferably in. physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the compounds 
may be formulated in solid form and redissolved or suspended immediately prior to 

20 use. Lyophilized forms are also included. 

For oral administration, the pharmaceutical compositions may take the form 
of, for example, tablets or capsules prepared by conventional means with 
pharmaceutical^ acceptable excipients such as binding agents (e.g., pregelatinised 
maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., 

25 lactose, microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., 
magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets may be 
coated by methods well known in the art. Liquid preparations for oral 
administration may take the foim of, for example, solutions, syrups or suspensions, 

30 or they may be presented as a dry product for constitution with water or other 
suitable vehicle before use. Such liquid preparations may be prepared by 
conventional means with pharmaceutically acceptable additives such as suspending 
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agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., ationd oil, 
oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., 
methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also 
5 contain buffer salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. For buccal administration the 
compositions may take the form of tablets or lozenges formulated in conventional 
manner. For administration by inhalation, the compounds for use according to the 
10 present application are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, tricMorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to 
15 deliver a metered amount. Capsules and cartridges of e.g., gelatin for use in an 
inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. . 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for injection 
20 may be presented in unit dosage form, e.g., in ampoules or in multi-dose containers, 
with an added preservative. The compositions may take such forms as suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the 
active ingredient may be in powder form for constitution with a suitable vehicle, 
25 e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases 
such as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may 
30 also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with 



-99- 



suitable polymeric or hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for 
example, as a sparingly soluble salt. 

Systemic administration can also be by transmucosal or transdermal means. 
5 For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration bile salts and 
fusidic acid derivatives, in addition, detergents may be used to facilitate permeation. 
Transmucosal administration may be through nasal sprays or using suppositories. 

10 For topical administration, the oligomers of the application are formulated into 
ointments, salves, gels, or creams as generally known in the art. A wash solution 
can be used locally to treat an injury or inflammation to accelerate healing. 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient 

15 The pack may for example comprise. metal or plastic foil; such as a blister pack. The 
pack or dispenser device may be accompanied by instructions for administration. 

For therapies involving the administration of nucleic acids,, the oligomers of 
the application can be formulated for a variety of modes of administration, including 
systemic and topical or localized administration. Techniques and formulations 

20 generally may be found in Remmington's Pharmaceutical Sciences, Meade 
Publishing Co., Easton, PA. For systemic administration, injection is preferred, 
including intramuscular, intravenous, intraperitoneal, intranodal, and subcutaneous 
for injection, the oligomers of the application can be formulated in liquid solutions, 
preferably in physiologically compatible buffers such as Hank's solution or Ringer's 

25 solution. In addition, the oligomers may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophilized forms are also 
included. 

Systemic administration can also be by transmucosal or transdermal means,, 
or the compounds can be administered orally. For transmucosal or transdermal 
30 administration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration bile salts and fusidic acid derivatives. In 
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addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For oral 
administration, the oligomers are formulated into conventional oral administration 
forms such as capsules, tablets, and tonics. For topical administration, the oligomers 
of the application are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

EXEMPLIFICATION 

The application now being generally described, it will be more readily 
understood by reference to the following examples, which are included merely for 
purposes of illustration of certain aspects and embodiments of the present 
application, and are not intended to limit the application. 

EXAMPLES 

Role of PO SH in virus-like particle (VLP1 budding 

1. Objective: * . . 

Use RNAi to inhibit POSH gene expression and compare the efficiency of viral 
budding and GAG expression and processing in treated and untreated cells. 

2. Study Plan: 

HeLa SS-6 cells are transfected with mRNA-specific RNAi in order to knockdown 
the target proteins. Since maximal reduction of target protein by RNAi is achieved 
after 48 hours, cells are transfected twice - first to reduce target mRNAs, and 
subsequently to express the viral Gag protein. The second transfection is performed 
with pNLenv (plasmid that encodes HIV) and with low amounts of RNAi to 
maintain the knockdown of target protein during the time of gag expression and 
budding of VLPs. Reduction in mRNA levels due to RNAi effect is verified by RT- 
PCR amplification of target mRNA. 

3. Methods, Materials, Solutions 
a. Methods 

i. Transfections according to manufacturer's protocol and as described in 
procedure. 



-101- 



ii. Protein determined by Bradford assay. 

in. SDS-PAGE in Hoeffer miniVE electrophoresis system: Transfer in Bio- 
Rad mini- protean II wet transfer system. Blots visualized using 

Typhoon system, and ImageQuant software 

(ABbiotech) 
b. Materials 



Material 


Manufacturer . 


Catalog # 


Batch # 


Lipofectamine 2000 
(LF2000) 


Life Technologies 


11668-019 


1112496 


OptiMEM 


Life Technologies 


31985-047 


3063119 


RNAi Lamin A/C 


Self 


13 




RNAi TSG101 688 


Self 


65 




RNAi Posh 524 


Self 


81 




plenvllPTAP. 


Self 


148 




plenvll ATAP 


Self 


149 




Anti-p24 polyclonal 
antibody 


Seramun 




A-0236/5- 
10-01 


Anti-Rabbit Cy5 
conjugated antibody 


Jackson 


144-175-115 


48715 


10% acrylamide Tris- 
Glycine SDS-PAGE gel 


Life Technologies 


NP0321 


1081371 


Nitrocellulose 
membrane 


Schleicher & 
Schuell 


401353 


BA-83 


NuPAGE 20X transfer 
buffer 


Life Technologies 


NP0006-1 


224365 


0.45jim filter 


Schleicher & 
Schuell 


10462100 


CS.1018-1 



c. Solutions 



Lysis Buffer Compound 



Concentration 
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Tris-HClpH7.6 


50mM 


MgCl 2 


15mM 


NaCI 


150mM 


Glycerol 


10% 


EDTA 


ImM 


EGTA 


ImM 




ASB-14 (add immediately 
before use) 


1% 


6X Sample 
Buffer 


Tris-HCl, pH=6.8 


1M 


Glycerol 


30% 


SDS 


10% 


DTT 


9.3% 


Bromophenol Blue 


0.012% 


TBS-T 

...... 


Tris pH=7.6 


20mM 


NaCI 


137mM 


Tween-20 


0.1% 



Procedure 
a. Schedule 



Day 


1 


2 


3 


4 


5 


Plate 


Transfection 


Passage 


Transfection II 


Extract RNA 


cells 


I 


cells 


(RNAi and 


for RT-PCR 




(RNAi only) 


(1:3) 


pNlenv) 


(post 








(12:00, PM) 


transfection) 








Extract RNA for 


Harvest VLPs 








RT-PCR 


and cells 








(pre-transfection) 





b. Dayl 
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Plate HeLa SS-6 cells in 6-well plates (35mm wells) at concentration of 5 XI 0 5 
cells/well, 
c. Day 2 

2 hours before transfection replace growth medium with 2 ml growth medium 
without antibiotics. 

Transfection I: 

RNAi A B 
RNAI [nM] p() ^ M1 oPti'MEM LF2000 mix 

Reaction RNAi name TAGDA# Reactions pi (pi) (pi) 



1 


Lamin A/C 


13 


2 


50 


12.5 


500 500 


2 


Lamin A/C 


13 


1 


50 


6.25 


250 250 


3 


TSG101 688 


65 


2 


20 


5 


500 500 


5 


Posh 524 


81 


2 


50 


12.5 


500 500 



Transfections: • * 

Prepare LF2000 mix, : 250jil OptiMEM + 5 jul LF2000 for each reaction. Mix by 
1 0 inversion, 5 times. Incubate .5 minutes at room temperature. 

Prepare RNA dilution in OptiMEM (Table 1, column A). Add LF2000 mix 

dropwise to diluted RNA (Table 1, column B). Mix by gentle vortex. Incubate at 

room temperature 25 minutes, covered with aluminum foil. 

Add 500^1 transfection mixture to cells dropwise and mix by rocking side to 
15 side. 

Incubate overnight. 

d. Day 3 

Split 1:3 after 24 hours. (Plate 4 wells for each reaction, except reaction 2 which 
is plated into 3 wells.) 
20 e. Day 4 

2 hours pre-transfection replace medium with DMEM growth medium without 
antibiotics. 

Transfection II 

A B C D 
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RNAi 









Plasmid 


[20MM]for 










Plasmid 


for 2.4 jjg 


10nM 


OPtiMEM LF2000 


Reaction RNAi name TAGDA# Plasmid Reactions (pg/pl) 


(Ml) 


(MD 


(MO 


(Ml) 


1 LaminA/C 13 


PTAP 


3 


3.4 


3.75 


750 


750 


2 LamihA/C 13 


ATAP 


3 


2.5 


3.75 


750 


750 


3 TSG101 688 65 


PTAP 


3 


3.4 


3.75 


750 


750 


5 Posh 524 81 


PTAP. 


3 


3.4 


3.75 


750 


750 



Prepare LF2000 mix: 250jil OptiMEM + 5 *il LF2000 for each reaction. Mix by 
inversion, 5 times. Incubate 5 minutes at room temperature. 
Prepare RNA+DNA diluted in OptiMEM (Transfection II, A+B+C) 
5 Add LF2000 mix (Transfection II, D) to diluted RNA+DNA dropwise, mix by 

gentle vortex, and incubate lh while protected from light with aluminum foil. 
Add LF2000 and DNA+RNA to cells, SOO^l/well, mix by gentle rocking and 
incubate overnight. 

f. Day 5 

10 Collect samples for VLP assay (approximately 24 hours post-transfection) by the 

following procedure (cells from one well from each sample is taken for KNA 
assay, by RT-PCR). 

g. Cell Extracts 

i. Pellet floating cells by centrifugation (5min, 3000rpm at 4°C), save 
IS supernatant (continue with supernatant immediately to step h), scrape 

remaining cells in the medium which remains in the well, add to the 
corresponding floating cell pellet and centrifuge for 5 minutes, 1800rpm at 
4°C 

ii. Wash cell pellet twice with ice-cold PBS. 

20 iii. Resuspend cell pellet in lOOjil lysis buffer and incubate 20 minutes on 

ice. 

iv. Centrifuge at 14,000rpm for 1 Smin. Transfer supernatant to a clean tube. 
This is the cell extract 

v. Prepare 10m1 of cell extract samples for SDS-PAGE by adding SDS- 
25 PAGE sample buffer to IX, and boiling for 1 0 minutes. Remove an aliquot 
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of the remaining sample for protein determination to verify total initial 
starting material. Save remaining cell extract at -80 °C 
h. Purification of VLPs from cell media 

i. Filter the supernatant from step g through a 0.45m filter. 
5 ii. Centrifuge supernatant at 14,Q00rpm at 4°C for at least 2h. 

iii. Aspirate supernatant carefully. 

iv. Re-suspend VLP pellet in hot (100°C warmed for 10 min at least) IX 
sample buffer. 

v. Boil samples for 10 minutes, 100°C. 
10 i. Western Blot analysis 

i. Run all samples from stages A and B on Tris-Glycine SDS-PAGE 1 0% 
(120V for 1.5h.). 

ii. Transfer samples to nitrocellulose membrane (65V for 1 .5h.). 

iii. Stain membrane with ponceau S solution. 

15 iv. Block with 10% low fat milk in TBS-T for Ih; 

v. Incubate with anti p24 rabbit 1 :500 in TBS-T o/n. 

vi. Wash 3 times with TBS-T for 7min each wash. 

vii. Incubate with secondary antibody anti rabbit cy5 1 :500 for 30min. 

viii. Wash five times for lOmin in TBS-T 

20 be. View in Typhoon gel imaging system (Molecular Dynamics/APBiotech) 

for fluorescence signal. 
Results are shown in Figures 1 1-13. 

2. Exemplary POSH RT-PCR primers and siRNA duplexes 

25 RT-PCR primers 





Name 


Position 


Sequence 


Sense primer 


POSH=271 


271 


5 , C^TGCCTTGCCAGCATAC3 , (SEQIDNO:12) 


Anti-sense 
primer 


POSH=926c 


926C 


5 , CTGCCAGCATTCCTTCAG3 , (SEQIDNO:13) 



siRNA duplexes: 

siRNANo: 153 

siRNA Name: POSH-230 
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Position in mRNA 
TaTget sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



426-446 

5' AACAGAGGCCTTGGAAACCTG 3' 

5' dTdTCAGAGGCCUUGGAAACCUG 3* 

5'dTdTCAGGUUUCCAAGGCCUCUG 3' 



SEQ ID NO; 14 
SEQ ID NO: 15 
SEQ ID NO: 16 



10 



siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



155 

POSH-442 
638-658 

5' AAAGAGCCTGGAGACCTTAAA 3' 

5* ddTdTAGAGCCUGGAGACCUUAAA 3' 

5' ddTdTUUUAAGGUCUCCAGGCUCUS* 



SEQ ID NO: 17 
SEQ ID NO: 18 
SEQ ID NO: 19 



siRNA No: 
siRNA Name: 
1 5 Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



157 

POSH-U111 
2973-2993 

5' AAGGATTGGTATGtGACTCTG 3' 
5' dTdTGGAUUGGUAUGUGACUCUG 3' 
5' dTdTCAGAGUCACAUACCAAUCC 3' 



SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 22 



20 siRNA No: 

siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 

25 siRNA anti-sense strand: 



159 

POSH-U410 
3272-3292 

5' AAGCTGGATTATCTCCTGTTG 3' 
5' ddTdTGCUGGAUUAUCUCCUGUUG 3' 
5' ddTdTCAACAGGAGAUAAUCCAGC 3' 



SEQ ID NO: 23 
SEQ ID NO; 24 
SEQ ID NO: 25 



siRNA No.: 
siRNA Name: 
Position in mRNA: 
30 Target sequence: 
NO: 36 



187 

POSH-control 

None. Reverse to #153 

5* AAGTCCAAAGGTTCCGGAGAC 3' 



SEQ ID 
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3. Effects of POSH RNAi on HIV Release: Kinetics 

Al. Transfections 

1 . One day before transfection plate cells at a concentration of 5x1 0 6 cell/well 
5 in 15cm plates. 

2. Two hours before transfection, replace cell media to 20ml complete DMEM 
without antibiotics. 

3. DNA dilution: for each transfection dilute 62.5pl RNAi in 2.5ml OptiMEM 
according to the table below. RNAi stock is 20pM (recommended 

10 concentration: 50nM, dilution in total medium amount 1 :400). 

4. LF 2000 dilution: for each transfection dilute 50jal lipofectamine 2000 
reagent in 2.5ml OptiMEM. 

5. Incubate diluted RNAi and LF 2000 for 5 minutes at RT. 

6. Mix the diluted RNAi with diluted LF2000 and incubated for 20-25 minutes 
15 atRT. 

7. Add the mixure to the cells (drop wise) and incubate for 24 hours at 37°C in 
CO2 incubator, ' ' \ 

8. One day after RNAi transfection split cells (in complete MEM medium to 2 
15cm plate and 1 well in a 6 wells plate) 

20 9. One day after cells split perform HIV transfection according to SP 30-012- 

01. 

10. 6 hours after HIV transfection replace medium to complete MEM medium. 



25 



* It is important to perform RT-PCR for Posh to assure complete knockdown. 



A2. Total RNA purification. 



1 . One day after transfection, wash cells twice with sterile PBS. 

2. Scrape cells in 2.3ml/200^1 (for 15cm plate/1 well of a 6 wells plate) Tri 
reagent (with sterile scrapers) and freeze in -70°C (RNA purification and RT- 

30 PCR will be done by molecular biology unit) rack no. A 1 6 - samples for RT. 

J Treatment j Chase time | Fraction | Labeling | 
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(hours) 






Control=WT 


1 


Cells 

VLP j 


Al 

Al V 




2 


Cells 
VLP 


A2 
A2V 




3 


Cells 
VLP 


A3 
A3 V 




4 


Cells 
VLP 


A4 
A4V 




5 


Cells 
VLP 


A5 
A5V 


Posh+WT 


1 


Cells 
VLP 


Bl 

BlV 




2 


Cells 
VLP 


B2 
B2 V 




3 


Cells 
VLP 


B3 
B3 V 




4 


Cells 
VLP 


B4 
B4V 




5 


Cells 
VLP 


B5 
B5 V 



B* Labeling 

1 . Take out starvation medium, thaw and place at 37°C. 

2. Scrape cells in growth medium and transfer gently into IS ml conical tube. 
5 3. Centrifuge to pellet cells at 1 800rpm for 5 minutes at room temperature. 

4. Aspirate supernatant and let tube stand for 10 sec. Remove the rest of the 
supernatant with a 200jil pipetman. 
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5. Gently add 10ml warm starvation medium and resuspend carefully with a 
10ml pipette, up and down, just turning may not resolve the cell pellet). 

6. Transfer cells to 10ml tube and place in the incubator for 60 minutes. Set an 
Eppendorf thermo mixer to 37°C. 

7. Centrifuge to pellet cells at 1 800rpm for 5 minutes at room temperature. 

8. Aspirate supernatant and let tube stand for 10 sec. Remove the rest of the 
supernatant with a 200 pi pipetman. 

9. Cut a 200pl tip from the end and resuspend cells (~ 1.5 10 7 cells in 150 pi 
RPMI without Met, but try not to go over 250 jul if you have more cells) 
gently in 150 pi starvation medium. Transfer cells to an Eppendorf tube and 
place in the thermo mixer. Wait 10 sec and transfer the rest of the cells from 
the 10 ml tube to the Eppendorf tube, if necessary add another 50 pi to splash 
the rest of the cells out (all specimens should have the same volume of 
labeling reaction!). 

10. Pulse : Add 50 jil of 35 S-methionine (specific activity 14.2 pCi/^il), tightly 
cup tubes and place in thermo mixer. Set the mixing speed to the lowest 
possible (700 rpm) and incubate for 25 minutes. 

11. Stop the pulse by adding 1ml ice-cold chase/stop medium. Shake tube very 
gently three times and pellet cells at 6000rpm for 6 sec. 

12. Remove supernatant with a 1ml tip. Add gently 1ml ice-cold chase/stop 
medium to the pelleted cells and invert gently to resuspend. 

13. Chase : Transfer all tubes to the thermo mixer and incubate for the required 
chase time (830:1,2,3,4 and 5 hours; 828: 3 hours only). At the end of total 
chase time, place tubes on ice, add 1ml ice-cold chase/stop and pellet cells 
for 1 minute at 14,000 rpm. Remove supernatant and transfer supernatant to 
a second eppendorf tube. The cell pellet freeze at -80°C, until all tubes are 
ready. 

14. Centrifuge supernatants for 2 hours at 14,000rpm, 4°C. Remove the 
supernatant very gently, leave 20 |xl in the tube (labeled as V) and freeze at - 
80°C until the end of the time course. 

*** All steps are done on ice with ice-cold buffers 
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15. When the time course is over, remove all tubes form -80°C. Lyse VLP 
pellet (from step 14) and cell pellet (step 13) by adding 500 *il of lysis buffer 
(see solutions), resuspend well by pipeting up and down three times. 
Incubate on ice for 15 minutes, and spin in an eppendorf centrifuge for 15 

5 minutes at 4°C, 14,000 ipm. Remove supernatant to a fresh tube, discard 

pellet. 

16. Perform IP with anti-p24 sheep for all samples. 

C. Immunopercipitation 

10 1. Preclearing ; add to all samples 15jil ImmunoPure PlusG (Pierce). Rotate for 1 
hour at 4°C in a cycler, spin 5 min at 4°C, and transfer to a new tube for IP. 

2. Add to all samples 20\xl of p24-protein G conjugated beads and incubate 4 
hours in a cycler at 4°C. 

3. Post immunqprecipitations, transfer all immunoprecipitations to a fresh tube. 
15 4. Wash beads once with high salt buffer,, once with medium salt buffer and once 

with low salt buffer. After each spin don't remove all solution, but leave 50 jxl 
solution on the beads. After the last spin remove supernatant carefully with a 
loading tip and leave —10 fil solution. 
5. Add to each tube 20 »J 2x SDS sample buffer. Heat to 70°C for 10 minutes. 
20 6. Samples were separated on 10% SDS-PAGE. 

7. Fix gel in 25% ethanol and 10% acetic acid for 15 minutes. 

8. Pour off the fixation solution and soak gels in Amplify solution (NAMP 100 
Amersham) for 15 minutes. 

9. Dry gels on warm plate (60-80 °C) under vacuum. 
25 1 0. Expose gels to screen for 2 hours and scan. 

4. Effect of siRNA against human POSH on production of infectious virus from 
HeLa cells 

The following plan is according to the standard siRNA transfection protocol: 
30 Plan: 

Days 
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i 



2 



3 



4 



5.5 



Harvest virus 



HeLa cells 



Plate 



Transfect RN Ai 



Transfect siRN A + And start infectivity assay, 
Split high (1 :3) fflV-l NL4 ^ take samples for Western blot 



As siRNA the following was used: 

- # 13 (= Lamin control) 

- #153 (= human POSH Ub ligase) 

5 

day 3 (one days following first siRNA transfections): 

- HeLa cells were harvested, split again 1:3 in fresh DMEM and seeded in T75 
flasks. 

day 4: 

10 - medium was removed, fresh DMEM medium was added and cells were transfected 
again with siRNA in combination with one of the following HIV-1 expression 
. . plasmids: 

-Env- (HIV-lNL.if-3 (Adachi et al., 1986) with deletion in env gene, does not allow the 
expression of Env but other HTV-1 proteins), and VSV-G (CMV driven expression 
15 vector for vesicular stomatitis virus G protein). This combination allows single 
round infection as progeny viruses following infections with VSV-G pseudotyped 
viruses are free of Env glycoproteins. 

- Env+ (HTV-1 NL 4_3 wild type) for multiple round of infections . 

Note: Both construct Env- and Env-t- contained EGFP were cloned into the nef open 
20 reading frame of HIV-1 NL4-3- This way all cells transfected or infected cells with 
active virus gene expression can be detected by autoflourescence (based on T. 
Fukunori et al. 2000, Lenardo et al., 2002) 

For control one T25 flaks was transfected with GFP-N1 (a CMV expression vector 
for plain GFP). 



25 day 5: 

- the transfection efficiency was estimated by counting fluorescent cells in the GFP- 
Nl transfected culture using FACS analysis, 
day 5.5: 
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- 36 hrs after second transfection, viius was harvested. Virus stocks were prepared 
as follows: HeLa cells were scraped and virus-containing supematants were 
clarified by centrifugation (1,000 x g, 5 min) and filtered through a 0.45 pm-pore- 
size filter to remove residual cells and debris. Stocks were aliquoted and frozen at - 
80°C. For biochemical analyses virions from aliquots of supematants were pelleted 
(99 min, 14,000 rpm, 4°C) and lysed (according to Ott et al., 2002). Samples of cell 
and virus fractions were analyzed by Western blot using anti-CA antibodies. 

- For infectivity assay serial dilutions of virus stocks were prepared in RPMI 
medium and used to infect Jurkat cells. 

- 3 days post infection the percentage of infected cells in parallel cultures was 
estimated by FACS analyses. Each infection experiments were set up in 3 parallel 
cultures in 96 well plates (based on Bolton et al., 2002). 

- For control, one culture was incubated with cell culture supernatant form cell 
transfected with GFP-N1. No fluorescent cells were detected attesting for absence 
of unspecific staining of cells with GFP from the virus producer ceils. 

Remark: During virus production no toxic effect (other than some HIV- 
related cytopathic effect) was observed in human POSH transfected cultures when 
compared to the Lamin transfected culture^ 

Results, shown in Figure 19, demonstrate that knocking down POSH results 
in four logs reduction of HIV 1 infectivity. 

5. Effect of knocking do wn human POSH on PTAP- and PPEY-mediated viral 
release 

On the first day of the experiment, HeLa SS6 cells were plated in a 6-welI 
plate at 5 XI 0 5 cells/well. The cells were transfected on the second day with 
LF2000 reagent and with the following RNAi*s: Lamin, TSG101 and hPOSH 
(50nM/well). On the third day, cells were split 1:2. The cells were transfected again 
on the fourth day (with LF2000 reagent) with a supporting dose (lOnM/well) of the 
same RNAi's and DNA (lug/well): Lamin with PTAP, Lamin with ATAP, Lamin 
with PPEY, TSG101 with PTAP, TSG101 with PPEY, TSG101 with ATAP, 
hPOSH with PTAP, hPOSH with PPEY, and hPOSH with ATAP. 24 hours later 
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centrifuge supernatant for VLPs and extract protein from cells. Load all samples to a 
12% tris-glycine gel, transfer to a nitrocellulose membrane and incubate with <xp24. 

Results, shown in Figure 20, demonstrate that the inhibition effect of viral 
release of knocking down human POSH is independent of specifice HIV L-domain 
5 motif. In other words, human POSH appears to be important in both PTAP- and 
PPEY-mediated viral release. This suggests that inhibiting human POSH 
level/activity can be potentially effective against inhibihiting the infectivity of any 
virus containing PTAP or PPEY motifs. HIV (contains PTAP) and Ebola virus 
(contains PPEY) are examples of such viruses. 

10 

6. Effect of RING and SH3 deletions on HIV-1 release 

HeLa cells were sequentially transfected (within 48 hours) with a deletion 
construct of human POSH and pNLenv-1 . The deletion constructs used in this 
experiments were: dRING (RING domain deletion), dSH 3 4 (4 th SH 3 domain 

15 deletion), dSH 3 1 (1 st SH 3 domain deletion) and dSH 3 3(3"* SH 3 domain deletion). 
As controls, Hela cells were also sequentially transfected with mock and pNLenv-1 , 
and mock and pNLenv-1 ATAP respectively. 24 hours post transfection cells were 
subjected to pulse chase studies. Viral proteins were immunoprecipitated from cell 
lysates and pelleted virions, separated by SDS-PAGE and analyzed by fluorography. 

20 Positions of the two major CA products, p24 and p25, are indicated by double 
arrows. The rate of Pr55 processing was estimated by calculating the ratio of CA 
versus Pr55 detected intracellularly at different time point. The results, shown in 
Figure 21, show that RING deletion in hPOSH leads to complete budding inhibition 
compare to ATAP mutant control. SH 3 4 domain deletion effect in the same way as 

25 RING deletion. 

7. In- vitro assay of Human POSH self-ubiquitination 

Recombinant hPOSH was incubated with ATP in the presence of El , E2 and 
ubiquitin as indicated in each lane. Following incubation at 37°C for 30 minutes, 
30 reactions were terminated by addition of SDS-PAGE sample buffer. The samples 
were subsequently resolved on a 10% polyacrylamide gel. The separated samples 
were then transferred to nitrocellulose and subjected to immunoblot analysis with an 
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anti ubiquitin polyclonal antibody. The position of migration of molecular weight 
markers is indicated on the right. 

Poly-Ub: Ub-hPOSHconjugates, detected as high molecular weight adducts only in 
reactions containing El, E2 and ubiquitin. hPOSH-176 and hPOSH-178 are a short 
5 and a longer derivatives (respectively) of bacterially expressed hPOSH; C, control 
E3 

preliminary steps in high-throughput screen 
Objective 

1. Test Ub detection with in a Ub chain as function of an E3 (HRD1) and POSH 
10 auto-Ubiquitination. 

2. Test Boston Biochem reagents. 
Materials 

1 . El recombinant from bacculovirus 

2. E2 Ubch5c from bacteria 

15 3, Ubiquitin . . *... -.. 

4. POSH #178 (1-361) gst fusion-purified but degraded 

5. POSH # 176 (1-269) gst fusion-purified but degraded 

6. hsHRDl soluble ring containing region 

5. Bufferxl2 (Tris 7.6 40 mM, DTT ImM, MgCl 2 SmM, ATP 2uM) 
20 6. Dilution buffer (Tris 7.6 40mM, DTT ImM, ovalbumin lug/ul) 
protocol 





0. lug/ul 


0.5ug/ul 


5ug/ul 


0.4ug/ul 


2.5ug/u/ 


0.8ug/ul 






El 


E2 


ub ; 


176 


178 


Hrdl 


Bxl2 


-El (E2+176) 




0.5 


0.5 


1 






10 








-E2 (El+176) 


1 




0.5 


1 






9.5 


-ub (E1+E2+176) 


1 


0.5 




1 






9.5 








El+E2+176+Ub 


1 


0.5 


0.5 


1 






9 


-El (E2+178) 




0.5 


0.5 




1 




10 








-E2 (El+178) 


1 




0.5 




1 




9.5 


-ub (E1+E2+178) 


1 


0.5 






1 




9.5 


El+E2+178+Ub 


1 


0.5 


0.5 




1 




9 


Hrdl, El+E2+Ub 


1 


0.5 


0.5 






1 


8.5 
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1 . Incubate for 30 minutes at 37°C. 

2. Run 12% SDS PAGE, gel and transfer to nitrocellulose membrane 

3. Incubate with anti-Ubiquitin antibody. 

5 Results, shown in Figure 22, demonstrate that human POSH has 

ubiquitin ligase activity. 

8. Co-immunoprecipitation of hPOSH with tnvc-tagged activated CV12) and 
dominant-negative (N17) Racl 

10 Hela cells were transfected with combinations of myc-Racl V12 or N17 and 

hPOSHdelRING-V5. 24 hours after transfection (efficiency 80% as measured by 
GFP) cells were collected, washed with PBS, and swollen in hypotonic lysis buffer 
(lOmM HEPES pH=7.9, 15mM KC1, O.lmM EDTA, 2mM MgC12, ImM DTT, and 
protease inhibitors). Cells were lysed by 10 strokes with dounce homogenizer and 

15 centrifuged 3000xg for 10 minutes to give supernatant: (Fraction 1) and nucleii. 
Nucleii were washed with Fraction 2 buffer (0.2% NP-40, lOmM HEPES pH=7.9, 
40mM KC1, 5%. glycerol) to remove peripheral proteins. Nucleii. were spun-down 
and supernatant collected (Fraction 2). Nuclear proteins were eluted in Fraction 3 
buffer (20mM HEPES pH=7.9, 0.42M KC1, 25% glycerol, O.lmM EDTA, 2mM 

20 MgC12, ImM DTT) by rotating 30 minutes in cold. Insoluble proteins were spun- 
down 14000xg and solubilized in Fraction 4 buffer (1% Fos-Choline 14, 50mM 
HEPES pH=7.9, 150mM NaCl, 10% glycerol, ImM EDTA, l.SraM MgC12, 2mM 
DTT). Half of the total extract was pre-cleared against Protein A sepharose for 1.5 
hours and used for IP with 1 jig anti-myc (9E10, Roche 1-667-149) and Protein A 

25 sepharose for 2 hours. Immune complexes were washed extensively, and eluted in 
SDS-PAGE sample buffer. Gels were run, and proteins electro-transfeired to 
nitrocellulose for immunoblot as in figure 23. Endogenous POSH and transfected 
hPOSHdelRING-V5 are precipitated as a complex with Myc-Racl V12/N17. 
Results, shown in Figure 23, demonstrate that POSH co-immunoprecipitates with 

30 Racl. 

9 . Knock-down of hPOSH entraps HIV virus particles in intracellular vesicles. 
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HTV virus release was analyzed by electron microscopy following siRNA 
and full-length HTV plasmid (missing the envelope coding region) transfection. 
Mature viruses were secreted by cells transfected with HIV plasmid and non- 
relevant siRNA (control, lower panel). Knockdown of TsglOl protein resulted in a 
5 budding defect, the viruses that were released had an immature phenotype (upper 
panel). Knockdown of hPOSH levels resulted in accumulation of viruses inside the 
cell in intracellular vesicles (middle panel). Results, shown in Figure 24, indicate 
that inhibiting hPOSH entraps HIV virus particles in intracellular vesicles. As 
accumulation of HIV virus particles in the cells accelerate cell death, inhibition of 
10 hPOSH therefore destroys HIV reservoir by killing cells infected with HIV. 

10. POSH is localized at two sites one at the nuclei and one at the golgi (CV) . After 
HIV transfection recruitment of POSH to the golgi is enhanced CD) 

HeLa and 293T cells were transfected with pNenv-L 24 hours post 
1 5 transfection the cells were incubated with primary antibodies against POSH • 
, alongside with either anti-p24 Gag (B) or one of the organelle markers as follows: . . 
BiP- endoplasmatic reticulum (data not shown), GM 1 30- golgi, (C,D) nucleoporin- 
nuclei matrix (E, F) and histone HI- nucleus (data not shown). Two different 
florescence labled secondary antibodies were used. Specific signals were obtained 
20 by laser-scanning confocal microscopy, and individual signals were overlaid to 
assess their relative positions. As shown in figure 7, POSH (A) is localized in the 
nucleus and partially colocalized with HTV-1 Gag following transfection outside the 
nucleus, presumably in die nuclear matrix (B). 

25 1 1 . POSH Ubiquitin Ligase Activity 

Transfection is performed in 10 cm dishes of 293T cells. Cells are plated at 3.7x106 
cells per plate 24 hours prior to transfection in DMEM high glucose with 10% fetal 
calf serum and no antibiotic (complete serum). The day of transfection the medium 
30 is replaced with 5 ml of fresh complete medium, 2 hours prior to the addition of 
transfection cocktail. 
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Transfections: 

1 . Place 3 ml of OptiMEM™ into 1 0 ml snap cap tube. Add 1 20 jd 
lipofectamin 20QOTM (Invitrogene). Incubate for 5 minutes at room temperature. 

2. Meanwhile, place 1 ml OptiMEM™ in 3x10 ml snapcap tubes and the 
following DNAs (pNLenv-1, HIV-encoding plasmid; HA-Ubiquitin, a plasmid 
encoding ubiquitin with an HA tag; Empty plasmid, the vector used to clone POSH 
into it; POSH wild-type, foil length POSH encoding plasmid; POSH C to A, POSH 
plamid with a mutation at the first cystein of the RING doinain to an alanine):- 
Tube no. pNLenv-1 HA-Ubiquitin Expression plasmid 

A 2 /ig 2 fig Empty plasmid - 5 fig 

B 2/xg 2 fig POSH wild type - 5 fig 

c 2fig 2fig POSHCtoA-5/ig 

3. To each tube A, B and C add 1 ml from lipofectamine mix (from step 1). 
Incubate at room temperature for 1 hour. ; 

4. Add 1 ml of the transfection cocktail A (step 3) to plates number 1 and 4; 
transfection cocktail B to plates 2 and 5; transfection cocktail C to. plates 3 and 6. 

5. Incubate for 24 hours. 

6. At the end of incubation wash cells twice with lxPBS and scrape cells in 1 
ml PBS. 

7. Spin at 500xg for 5 minutes to pellet cells. Lyse cells in 1 ml Triton 
solubilization buffer (50 mM HEPES-NaOH, pH 7.4, 1 50 mM NaCl, 5 mM MgC12, 
0.5 mM EDTA, 0.5 mM DTT and protease inhibitor cocktail) for 10 minutes on ice 
and 10 minutes at 300C. 

8. Spin cells to pellet insoluble material at 8000xg for 20 minutes. Transfer 
supernatant to a fresh tube. 

9. Immunoprecipitate for 4 hours supernatants in tubes 1-3 with anti-HA 
antibody; and supernatants form tubes 4-6 with a cocktail of anti-gag antibodies. 

10. Wash beads extensively and add lx SDS sample buffer. Separate extracts on 
12.5% SDS-PAGE, transfer to nitrocellulose and immunoblot with anti-HA or anti- 
reverse transcriptase (RT) antibodies as specified in Figure 26. 
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As shown in Figure 27, cells expressing wild-type POSH show ubiquitination of the 
pi 60 Gag-Pol polypeptide. This ubiquitination is abolished by a Cys to Ala 
mutation in a conserved cysteine of the POSH Ring domain. 

5 11. POSH Protein-protein interactions 

POSH-associated proteins were identified by using a yeast two-hybrid assay. . 
Procedure: 

Bait plasmid (GAL4-BD) was transformed into yeast strain AH 109 
(Clontech) and transformants were selected on defined media lacking tryptophan. 

1 0 Yeast strain Yl 87 containing pre-transformed Hela cDNA prey(G AL4- AD) library 
(Clontech) was mated, according to the Clontech protocol, with a yeast strain 
containing the bait vector, and plated on defined media lacking tryptophan, leucine, 
histidine and containing 5 mM 3-amino-triazol. Colonies that grew on the selective 
media were tested for beta-galactosidase activity and positive clones were further 

15 characterized. Plasmid was recovered from yeast colonies and transformed into E. 
coli DHSalpha strain. After ampicillin selection plasmid was prepared from bacterial 
. colonies and transformed back into yeast strain AH 109 together with bait plasmid or 
empty bait vector and colonies selected on defined media lacking leucine and 
tryptophan and then scored for growth on media lacking tryptophan, leucine, 

20 histidine and containing 5 mM 3-amino-triazol. Only prey clones which their growth 
on this media was dependent on bait plasmid were scored as true hits. Prey clones 
were identified by amplifying cDNA insert and sequencing using vector derived 
primers. 
Bait: 

25 Plasmid vector: pGBK-T7 (Clontech) 

Plasmid name: pPL269- pGBK-T7 GAM POSHdR 

Protein sequence: Corresponds to aa 53-888 of POSH (RING domain deleted) 

RTLVGSGVEELPSNILLVRIjLDGIKQRPWKTCPGGGSGTNCTNAIJISQSSWANCSSKDL 
QSSQGGQQPRVQSWSPPVRGI PQX.PCAKALYNYEGKE PGDLKFSKGDI I ILRRQVDENWY 
30 HGEVNGI HGFFPTNFVQI I KPLPQPPPQCKALYDFEVKDKBADKDCI»PFAKDDVI»TVIRR 
VDENWAEGMLADKI GI FPI S YVEFNSAAKQLI EWDKPPVPGVDAGECS SAAAQSSTAPKH 
SDTKKNTKKRHS FTSLTMANKSSQASQNRHSME I S pPVLI S S SNPTAAARI S ELSGLSCS 
APSQVHISTTGLIWPPPSSPVTTCPSFTFPSDVPYQAALGTLNPPLPPPPLLAATVLAS 
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TP PGATAAAAAA6MGPRPMAGSTDQI AHLRPQTRPSVYVAI YPYTPRKEDELELRKGEMF 
LVFERCQDGWFKGTSMHTSKIGVPPGNYVAPOTRAVTNASQAKVPMSTAGQTSRGVTM^ 
PSTAGGPAQKLQGNGVAGS PS WPAAWSAAHI QTS PQAKVLIjHMTGQMTVNQARNAVRT 
VAAHNQERPTAAVTPIQVQNAAGLS PASVGLSHHSLAS PQPAPLMPGS ATHTAAI S I SRA 
5 SAPIiA(3tf*AAPLTSPSITSASLEAEPSGRIVTVIiP^ 

KKEKKGLLKLIiSGASTKRKPRVS PPAS PTLEVELGSAELPLQGAVGPELPPGGGHGRAGS 
CPVDGDGPVTTAVAGAAIiAQDAFHRKASSIiDSAVPIAPPPRQACSSIiGPVLNESRPVVCE 
RHRVVVSYPPQSEAELEIjKEGDIVFVHKKREDGWFKGTLQRNGKTGLFPGSFVENI 

Library screened: Hela pretransfonned library (Clontech). 
10 Three POSH-APs were identified: UNC48B (Hs.5898 unc-84 homolog A), 

MSTP028 (Hs. 302746) and HERPUD1 (Hs.146393). Examples of each are shown 
below. 

Human UNC84B mRNA sequence - varl (public gi: 21265160) 

GGC^CGAGGGCAAGTCGGCCAGGGAAGCCGCGGCCTCCCTGAGCCTGACGCT 
1 5 TGGAGTGACAGAGGAGCAGGTGCACCACATCGTGAAGCAGKSC^ 

GGGCTGGCAGACTACGCCCTGGAGTCAGGAGGGGCCAGCGTCATCAGC^ 

AGACCAAGACXXXTCCTCCTCAGCCTC^CGGCATCCCC 

CCTCCAGCCAGATGTGCACCCAGGCAACTGCTGGGCCTTCCAGGGGCCACA 

CTCTCTGCCCX3CATCCGCCCCACAGCCGTTACCTTAGAGCATGTGCCCAAGG 
20 CTATCTCCAGTGCCCCCAAGGACrTCGCCATCrTTGGC^ 

CCTTGGCAAGOTCACTTACGATC^GGACGGCGAGCCTATTCAGATO 

GCCACGTACCAGGTGGTGGAGCTGCGGATCCTGACTAACTCGGG 

GCTTCAGAGTGCATGGGGAGCCCGCCCACTAGCCCTGC^ 

GGGTGAACAG(^CCCX!GCC^CTTCCCC(^C7VCGCTTGCTCGGCGCrC^ 
25 GAGCCTGTGGCCCCATGCAGATGAAAAGGACGGGCAGGGTC 

AGCAGGCTCCAGCAGCTOCCTTCTTCCTTCCCTCTGTGCCCGTGGCGTCTGCTO 

GTATATATGTAGCATATCATGGGGGACTGK3GAAGOT 

TCAGGGGCTGGTGCCTGGGAGCTGATGAAG CAGGTGCCAGGGCTGTGGGAGG GG CAAGCT ACGGCCTGGG 

CTAGGTGAGCTGCCTCTGCCCCTGGGCAAGGAAGCGAGGCCCTCTGGGAGGAGGGTGCTTAGCTCCAGAG 
30 CAGGATGGGAOTTCCCCAGGCAGGAAGCACTTGATGGAGA 

CTCCACCDVGGGAAGCCTGAACCACAGGGTCCCTGAG^ 

AGGGTAGCAGTGGGCCATGGGGCTTCTTGTGCCCTAAAGGGGACTGGOT 

CAGGGCCAACCCTGTAGGCTTCCCCTCTGCTGGGGAC^ 

GGGGCCCACCCTGCTCCCTGTTCCTGCTAGGGCCTGCXI^^ 
3 5 AGGGTATGGAGACCTAGACCTGTCTTTGGGGCCATTAGCATC^ 

ATGGAACTC CTGGGCTCTTGTGGGGACGTTCAGGGTATCGGGGTG CGAGGTCTGTCTGCACCGGCCCCCA 

CATCTAACCAGGCCC^GATGTAGGGGTCGTCCGCTCAGGCTGCCCCCTTGGGCTCTTGCAGCTCTTGTTC 

AGGTAGTCX3CCCTTCTGGTTTGTTCTCTGTGGGGCAGTTGG 

TTACCCTGGATAGGGAAGGGGGAGGAGGGGACTTTTAGAGC 
40 TTTTCAAGGTCTGTTTTTCTAACTGAAAAGCTAAGGGCTTGATT 

GTGAOACTCAGTTrCTTGTTCCTTC 

TCCGGCTGCAGGGAAGGGGCAAGAAGGCGGGCAGGCCTTCT 
GAGCATGAGGCTGGATGCAGTGGTGGTGAGGCCGCCCCGCTCCATCCCG^ 
CGCTCTCCTGTCACAAATGCTGCACTAT^ 
45 TGCAAAAAATAAATGACGCCCAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human UNC84B mKNA sequence - var2 (public gi: 6538748) 

TCCCXKGTACACTCTCTGGAGCGGCGTCTGGAAGCTCTTGCTGCTGAATTTTCCT 
AGGCCATGCGGC1GGAACGTCTGGAGCTGCGGCAAGGGGCTCCTGGCX!AGGGAGGTGGT 
50 CCACGAGGACACCCTGGOTCTGCTGGAGGGGCTAGTGAGCCGCCGTGAAG 

CGCAGGGAAACTG CTGCTCX3 CATCCAGGAAGAACTGTCTGCCCTGAGAG CAGAGCAT CAGCAAGACTCAG 
AAGACCTCTTCAAGAAGATCGTCCGGGCCTCCCAGGAGTCCGAGGCrCGCA 

GTGGCAAAGCATGACCGAGGAGTC CTTCCAGGAGAGCTCTGTGAAGGAGCTGAGG CGGCTGGAGGACCAG 
CTGGCCGGCCTGC^GCAGGAGCTGGOGGCTCTGGCACTGAAGCAGAGCTCGGTGGC^ 
55 TGCTGCCCCAGCAGATCCAGGCCXSTGCGGGAOGACGTGGAATCTCAG^ 
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CCTTGCCCGAGGTGGAGGGGGC CGCGTGGGGCT CCTTCAGAGAGAGGAGATGCAAGCT CAGCTGCGAGAG 
CTGGAGAGCAAGATCCTCACCCATGTGGCAGAGATGCAGGGC3VAGTC 

TGAGCCTGACX3CTGCAGAAAGAAGGTGTGATTGGAGTGACAGAGGAGCAGGTGCACCACAT 
GGCCCTGCAGCGCTACAGTGAGGACCGCATCGGGCTGGCAGACTACGCCCTGGAGTCAGGAGGGGCCAGC 
5 GTCATCAGCACCCGATGTTCTGAGACCTACGAGACCAAGACGGCCCTCCTCAGCCTCTTCGGCATCCCCC 
TGTGGTACCACTCCCAGTCACCCCGAGTC&TCCTC^ 
CCAGGGGCCACAGGGCTTCGCCGTGGTCCGCCTCT^ 
CATGTGCCCAAGGCCTTGTCACCCAACAGCACrATCTCCAGTGCrc 
TTGLACGAAGACCTGCAGCAGGAGGGGACACTCCTTGGCAAGTTCA 
1 0 T.CAGACGTTTCACTTTCAGGCCCCTACX^T 

TGGGGCCACCCCGAGTACACCTGCATCTACCGCTTCAGAGTGCATGGGGAGCCCGCCCACTAG 

Human UNC84B mRNA sequence - var3 (public gi: 2056i57i) 

ACTCGCGCAGCGO^CCTCCGCCGGCCGCATTGTCCGCCCGCCCGCCTTG 

1 5 GCGTGCCCXXX3CGCCCCGGGCCCXMCCGCTGTGTa5CCCTGAGCGGAGCGCCGCCCGCGGGATCCCCACG 
CGGAAAGGGGGCGCGCCCCGGCGGCGGCCTGGCCTCGGACGCCCCCGGCCGGGCTAGAAGCCGCCGCGGC 
AGCAGATTCTCTTCAGGGGAAGAGTCCACATCCCACCTCATCATGTCCCGAAGAAGCCAGTO 
GQTACTCCCAGGGTGACX3ATGACGGCAGCAGCAGCAG CGGAGGGAG CT CGGTGG CTGGGAGTC7VGAGCAC 
CCTGTTTAAAGACAGTCCTCTCAGGACCTTGAAGAGGAAATCCAGCAAC 

20 CCAC^GCTGGGCCCGTCCTCTGATGCACACACCTCCTACTACAGTGAGTCGCTC 

TCCC^CCCAGGAGCTCCCTGGAGGJVACTGCATGGTGACGCCAACTGGGGTGAGGACCT 

GAGG AGAGGCAC GGGTGGCT CAG AGAGCAG CAGGGCCAGCGGGCTTGTGGGG CG CAAGGCCACCGAGGAC 

TTCCTGGGCTCTTCCTCGGGCTACTCCTCTGAGGACGACTACGTGGGCTACTCGGATGTGGACCAGCAGA 

GTTCCAGCTOTCGGCTCCGAAGCGCCGTCTCACGGGCGGGCTCCTTACTCTGGATGG 

25 AGGCCGGCTCTTCAGACTTCTCTACTGGTGGGCTGGCAC^ 



TGCTCTTGCTGACGTGCCTGACGTATGGTGCTTGG 

TCGCC^CATTTCCAGGCTGAGCAGCGTGTTATGTCCCGGGTACACTCTCTGGAGCGGCGT 
30 TTGCTGCTGAATTTTCCTCCAACTGGC^GAAGGAGGCCAT 

GGCTCCTGGCCAGGGAGGTGGTGGTGGCCTGAGCCACGAGGACACCCTGGro 

AGCCGCCGTGAAGCTGCCCTGAAGGAGGATTTCCGC^GGGAAACTGCTG 

CTGCCCTGAGAGCAGAGCATCAGCAAGACTCAGAAGACCTCT 

GTCCGAGGCTCGCATCCAGCAGCTGAAGTC^GAGTGGCAAAGCATGACCCAGGAGTCCTTCC^ 
3 5 TCTGTGAAGGAGCTGAGGCGGCTGGAGGACC^GCTGGCC^^ 

TGAAGCAGAGCTCGGTGGCGGAAGAAGTGGGCCTXH^CCC^ 

GGAATCTCAGTTCCCGGCCTGGATCAGTCAGTTC CTTGCCCGAGGTGGAGGGGG CCGCGTGGGGCTCCTT 
CAGAGAGAGGAGATGCAAGCTCAGCTGCGAGAGCTGGAGAGCAAGATC 

AGGGCAAGTCGGCCAGGGAAGCCGCGGCCTCCCTGAGCCTGACGCTGCAGAAAGAAGGTGTGATTGGAGT 
. 40 GAmGAGGAGCAGGTGCACCACATCGTGAAGCAGGCCCTGCAG^^ 

G CAGACTACG CCCTGGAGTCAGGAGGGGCCAGCGTCATCAGCACCCGATGTTCT GAGACCTACGAGACCA 
AGACGGCCCTCCTCAGCCTCTTCGGCATCCCCCTGTGGTACCACTCCC^ 
GCCAGATGTGCACCCAGGCAACTGCTGGGCCTTCCAGGGGCCACAA 
GCCCGCATCCXK!CCCACAGCCGTTACX^rTAGAGCA^ 
45 CCAGTGCCCCCAAGGACnrrCGCCATCTTTGGGTT^ 

CAAGTTCACTTACGAT^GGACGGCGAGCCTATTC^GACGTTTCACTTTCA 

TACCAGGTGGTGGAGCTGCGGATCCTGACTAACTGG 

GAGTGCATGGGGAGCCCGCCCACTAGCCCrGCTTACroGTC 

50 GTGGCCC<^TGCAGATGAAAAGGACGGGCAGGGTCTCCTGAGCAG 
CTCCAGCAGCTCCCTTCTTCCTTCCCTCTOT^ 

ATGTAG CATAT CATGGGGGAC TGGGAAGTTGGGAGAGGTAGGACCTGACTGGTCTTGGCTGGGGTCAGGG 
GCTGGTGCCTXMGAGCTGATGAAGCAGGTGCCAGGGCTC 

GAGCTGCCTCTGCCCCTGGG CAAGGAAGCGAGG CCCTCTGGGAGGAGGGTGCTTAGCTCCAGAGCAGGAT 
55 GGGACTTCCCCAGG(^GGAAGCACTTGATGGAGAGCTG CCCAGCTCTCCTACAA GGTTAGTGCCCTCCAC 

C TAGGGAAGCATGAACCACAGGGTCCCTGAGGG CCTTCXSACAAAAGTGTGTATTTGTCCCGGGGAGGGTA 

GCAGTGGGCCATGGGGCTTCTTGTGCCCTAAAGGGGACTGGCTGCTGTGAT 

CAACCCTGTAGGCTTCCCCTCTGCTGGGGAGGGTAGTTGCTTTTCTCTCTCCTGATG 

CACCCTGCTCCCTGTTCCTGCTAGGGCCTGCCAGTGCCCCTGAGCT^ 
60 TGGAGACCT AGACCTGTCTTTGGGG CCATTAGCATCTGGGGTTATAG CAAGAAG AGTGGGGAGCATGGAA 

CTCCTGGGCTCTTGTGGGGACGTTCAGGGTATCGGGGTGCGAGGTCTGTCTC 

ACCAGGCCCTGATGTAGGGGT CGTCCGCTCAGGCTG CCCCCrrTGGGCTCrTGCAGCX , C"rr(J'rrCAGGTAG 
TCGCCCTTCTGGTTTGTTCTCTGTGGGGC^VGTT 

TGGATAGGGAAGGGGGAGGAGGGGACTTTTAGAGCCAGCAGGCCCCACTGTATTATGTA 
65 AGGTCTGTTTTTCTAACTGAAMGCTAAGGGCTTGAOT 
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CTCAGTTTCTTGTTCCTGGCCGTGGAGAGGGGCCTGGGGCACTGGTTCCGGCTGTGTCrcG 
TGCGGGGAAGGGGCAAGAAGGCGGGCAGGCCTTCACTGC^GC^C 
GAGGCTGGATGCAGTGGTGGTGAGGCCGCCCCGCrrCC^ 
CCTGTCACAAATGCTGCACTATTGGTT^ 
5 AAATAAATGACGCCCAAGA 

Human UNC84B mRNA sequence - var4 (public gi: 3327149) 

CTCAT CATGTCC CGAAG AAG CCAGCGCCT CACG CGCTACTC CCAGGGTGACGATGACGGCAGCAGCAGCA 
1 0 GCGGAGGGAGCTCGGTGGCTGGGAGT(^GAGC^CCCTGTTTAAAGACAGTCCTCTCAGGACCTTGAAGAG 
GAAATCC^GCAACATGAAGCGCCTGTCCCCAGCGCCACAGCTGGGCCCGTCCTCTGATGCACACACCTCC 
TACTACAGTGAGTCGCTGGTCCACGAGTCCTGGTTCCCACCCAGGA.GCTCCCTGGAGGAACTGCATGGTG 
ACGCCAACTGGGGTGAGGACCTGCGGGTGCGGAGGAGGAGAGGCACGGGTGGCTC^ 
CAGCGGGCTTGTGGGGCGCAAGGCCACCGAGGACTTCCTGGGCTCTTC 
15 GACTACGTGGGCTACTCGGATGTGGACC^GC^GAGTTCCAGCTCGCGGCTCCGAAGCGCCGTCTCACXS^ 
CGGGCTCCTTACTCTGGATGGTGGCC^CTTCGCCAGGCCGGCTCTTCAGACTTCTCTACTGGTGGGCTGG 
CACCACCTGGTACCGCCTGACCACAGCTGCCrCCCTCCTTGACGTCTTCGTTTTAACCAGGCGCTTCTCG 
TCCCTGAAGACGTTCCTCTGGTTCCTGCTGCCXSCrGCTCTTGCTGACGTGCCTGACGTATGGTGCTTGGT 
ATTTCTACCCCTATGGGCTGCAGACATTCCACCCTGCTTTGGTTTC 
20 GAGGCCGGATGAGGGCTTGGGAAGCCAGAGACTCATC 

CGGGTACACTCrCTGGAGCGGCGTCTGGAAGCTCrrGCTGCTGAATTTTCCTCCAACTC 

CGAGGACACCCTGGCGCTGCTGGAGGGGCTAGTGAGCCGCCGTGAAGCTGCCCTGAAGGAGGATTTCCGC 
AGGGAAACTGCTGCTCGCATCCAGGAAGAACTGTCTGCCCTGAGAGCAGAGCA 

25 ACCTCTTCAAGAAGATCGTCCGGGCCTC 

GC^AAGCATGACCCAGGAGTCeTTCCAGGAGAGCTCTGTGAAGGAGCTGAGGCGGCrr 

TGCCC^GCAGATCCAGGCCGTGCGGGACGACGTGGAATCTCAGTTCCCGGCCTGGATCAGTCAGTTCCT 

TGCX:CGAGGTGGAGGGGGCCGCGTGGGGCTCCTT(^GAGAGAGGAGATGCMGCT * 
30 GAGAGCAAGATCCTCACCCATGTGGCAGAGATGCAGGGCAAGTCGGCCAGOT 

GCCTGACX3CTGCAGAAAGAAGGTGTGATTGGAGTGACAGAGGAGCAGGTGCACCA 

CCTGCAGCGCTACAGTGAGGACCGCATCGGGCTGGCAGACTACGCCCTGGAGTC^ 

ATCTVGCACCCGATGTTCTGAGACCTACGAGACCAAG^^ 

(MTACCACTCCCAGTCACCCCGAGTCATCCTCCAGCCAGAX 
3 5 GGGGCCA(^GGGCTTCGCCGTGGTC03CGrCrCTGCCCGC^TCCGCCCCACAGCCGT 

GTGCCXIAAGGCCTTGTCACCCTU^CAGCACT 

ACGtf^GACCTGCAGCAGGAGGGGAC^CTCCTTGG 

GACGTTTCACTTTCAGGCCCCTACGATGGCCACGTACCAGGTGGTGGAGCTGCGGA 
GGCCACCCCGAGTAC^CCTGCATCTACCGCTT^ 
40 TGGTGCCTGCTOCCAGCCATCTGGGAGTGGGTGAACAGCACCTC 

CGCTCTGACTTCTAGGAG CACAAGAGAGGAGCCTGTGG CCCCATGCAGATGAAAAGGACGGG CAGGGTCT 
CCTGAGCAGCAGGTGGCTCGAGGCGGTTAGCAGGCT 

TGGCGTCTGCTTCC CATCCTGGGAGTGTGTATATATGTAG CATATCATGGGGGACTGGGAAGTTGGGAGA 

GGTAGGACCTGACTGGTCTTGGCTGGGGTCAGGGGCTGGTGCCTGGGAGCTGAT 
45 CTGTGGGAGGGGCAAGCTACGGCCTGGGCTAGGTGAGCTGC 

TCTGGGAGGAGGGTGCTTAGCTCC^GAGCAGGATGGGACTTCCCCAGGCA 

TGCCCAGCTCTCCTACAAGGTTAGTGCCCTCCACCT 

TCGACAAAAGTGTGTATTTGTCCCGGGGAGGGTAGCAGTGGGCCA 

ACTGGCTGCTGTGATCTTCTAAGGGGCCCAGGGCCAACCCTGTAGGCTTCCCCTCTG 
50 TTGCTTTTCTCTCTCCTGATGCTAGGTTGGGGCC^ 

CCCCTGAGCTTGCTTTCCAGATTCTCCCAGGGTATGGAGACCTAGACCTG 

GTGCGAGGTCTGTCTGCACCGGCCCCCACATCTAACCAGGCCCTGATGTAGGGGTCGTCCGCT 

55 GGGGGCTGGGGGAAGAGGCTGGCAGAAGTTACCCTGGATAGGGAAGGGGGAG GAGGGGACTTTTAGAG CC 

AGCAGGCCCCACTGTATTATGTATATATTTTTCAAGGTCTC 

TTCCTAGCCCCOTTCTGTGGGGCACTGG 

GGGCACTGGTTCCGGCTGTGTCTGGTGGTCCGGCT 

TGCAGCACTGAGCCTCAAATCCGCTCTGGAGCATGAGGCTGGATGCA 
60 CATCC CGAGGCAGC CAGGCTTTGTTTTGCG CTCTCCTGTCACAAATGCT 

TTTATCTCCAGATCCTAATTTATGCCTATGCAAAAAATAAATGACGCCCAAGAGCTG 

Human UNC84B Protein sequence - varl (public gi: 21265161) 

HEGKSAREAAASLS LTLQKEGVI GVTEEQVHHIVKQALQRYSEDRI GLADYALESGGASVI STRCSET YE 
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TKTALItS LFGI PLW YHS QS PRVILQPDVHPGNCWAFQGPQGFAWRLS ARI RPTAVTLEHVPKALSFNST 
I SSAPKDFAX FGFDEDLQQEGTLIiGKFT YDQDGEPIQTFHFQAPTMATYQVVEXiRILTNWGHPEYTCI YR 
FRVHGEPAH 

5 Human UNC84B Protein sequence - var2 (public gi: 6538749) 

SRVHSLERRLEAIAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSH^^ 

RRETAARI QEELSALRAEHQQDS EDLFKKXVRAS QESEARI QQLKS EWQSMTQE S FQE S S VKELRRLEDQ 
lAGLQQEIAALALKQSSVAEEVGIiLPQQIQAVRDDVESQFPAWISQFIiARGGGGRVGLLQREEM 
LESKIXjTHVAEMQGKSAREAAASLSLTIjQKEGVIGVTEEQVHHIVKQALQRYSED 
1 0 VISTRCSETYETKTALLSLFGI PLWYHSQSPRVILQPDVHPGNCWA FQGPQGFAWRIi SARI RPTAVTLE 
HVPKALSPNST I SSAPKDFAI FGFDEDLQQEGTLLGKFTYDQDGEP I QTFHFQAPTMATYQVVELRILTN 
WGHPEYTCIYRFRVHGEPAH 

Human UNC84B Protein sequence - var3 (public gi: 14778953) 

1 5 MSRRSQRLTRYSQGDDDGS S SSGGS S VAGSQSTLFKDS PLRTLKRKSSNMKRLS PAPQLGPS SDAHTS YY 
SESIjVHESWFPPRSSLEELHGDANWGEDIiRVRRRRGTGGSES SRASGLVGRKATEDFLGS S SGYSSEDDY 
VGYSDVDQQSSS SRLRS AVSRAGSLLWMVATS PGRL FRI*L YWWAGTTWYRLTTAASIiI»DVFVLTRRFS SL 
KTFLWFLLPIiLLLTCLT YGAWYF YP YGM^FHPALVSWWAAKDS RRPDEGWEARDSS PHFQAEQRV^ RV 
HSLERRLEALAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSHEDTLAIiL 

20 TAARIQEELSALRAEHQQDSEDLFKKIVRASQESEARIQQLKSEWQSMTQESFQESSVKEIiRRIiEDQIiAG 
LQQELAALALKQS SVAEEVGLLPQQI QAVRDDVESQFPAW I SQFLARGGGGRVGLLQREEMQAQLRELES 
KI LTHVAEMQ G KS A REAAAS LS LTLQKEGV I GVTEEQVHHI VKQ ALQR YS EDRI GLAD YALE £ GGAS V I S 
TRCS ETYETKTALLSLFGI PLW YHSQS PRVILQPDVHPGNCWAFQGPQGFAWRL SARIRPTAVTLEHVP 
KALS PNSTI SSAPKDFAI FGFDEDLQQEGTLLGKFT YDQDGEPI QTFHFQAPTMAT YQWELRI LTNWGH 

25 PEYTCIYRFRVHGEPAH 

Human UNC84B Protein sequence - var4 (public gi: 22096I84) 

LRGVPVWAAGAFRFSSGEESTSHLIMSRRSQRLTRYSQGDDDGSSSSGGSSVAGSQSTLFKDSPLRTLKR 
KSSNMKRLS PAPQLGPS SDAHTS YYSESLVHESWFP PRSSLEELHGDANWGEDIjRVRRRRGTGGSESSRA 
30 SGLVGRKATEDFLGS S SGYSSEDDYVGYSDVDQQSS SSRLRSAVSRAGSLLWMV ATSPGRLFRLL YWWAG 
TTVTYRLTTAASLLDVFVLTRRFSSLKTFLWFIjLPLLLLTC3jTYGAVrYFYP 

RPDEGWEARDSSPHFQAEQRVMSRVHSLERRLEALAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSH 
BDTLALLEGLVSRREAALKEDFRRETAARI QEELS ALRAEHQQD S EDL FKKI V£AS QES EAR IQQLKS EW 
QSOTQESFQEJSSVKELRRLEDQIiAGLQQELAALALKQSSVJ^ 
35 ARGGGGRVGLLQREEWQAQLRELESKILTHVAEMQGKSAREAAAS LS LTLQKEGVIGVTEEQVHHI VKQA 
MRYSEDRIGLADYALESGGASVISTRCSETYETKTAIjLSLFGIPLWYHSQSPRVILQPDVHPGNCWAFQ 
GPQGFAWRLSA RI RPTAVTLEHVPKALSPNSTI SSAPKDFAI FGFDEDLQQEGTLIiGKFTYDQDGEPI Q 
TFHFQAPTMATYQWELRILTNWGHPEYTCIYRFRVHGEPAH 

40 

Rat UNC84B mRNA sequence (public gi: 27662735) 

ATCCTCCAGGCATCAGAGAGCAGCAACCTCTTCACTGAAG 

C^CATTCCTGGraCCAATCCTGGTTCTAGGGCTTGGAGCCCCA 

TAAGCCCTGGGGCACTGGTAATACCCAGTTTCGCATTCACAT 

TCTCCGGCACCTCTAGC^TCTATOTCTTGCTGTGCAGATC 

CATGTCAAGACXSAAGCCAGCGCCrCACGCGCTACrrCCCAGGATGACAA 

G CAAGCTCCGTGGC7VGGAGG CCAGAGCACCGTGTTTAAAGACAGTCCTCTC^ 

CCAGCAACATGAAG CGCCTGTCCCCAGCTCCGCAGCTGCCCCCC CCCTCTGACTCCCACACCTCCTACTA 
50 CAGCGAGTCTGTGGTTCGGGAGTCCTACATAGGCAGCCCCQ 

CTGGATGACCACCTAi^CAGTGAGCCCrrACTGGAGTGGGGACCr 

GCGGCTCTGAGAGCAGCAAAGCCAATGGGCTC^ 

TTCXTTCAGGCTACTCTTCTGAGGATGACCTTGCAGGCT 

GGGTTAAGAAGTGCAGCATCTCGGGCCGGCTCCTTTGTCTGGAC^ 
5 5 TTGGTCTTCTCTACTGGTGGGTTGGCACCACCTGGTACCGTCTGACAAC^ 

CTTCGTCCTAACCAGGTCCAGGC^CTTCTCACCAAACCTGAAGAGTTTTCT 

CTACTCCTGACTGGTCTGACCTACGGGCTCCAGACACT 

AGGAGAGTAGGAGGCAGCCGGAGGTGTGGGACACCAGGGATGCCTCCTCGCACTTGCAG^ 
CATTCTCTCCCGGGTTCACTCTCTGGAGCGGCGCCTGGAAGC^ 
60 CAGAAGGAGGCCATACX3GCTGGAGCGCTTGGAGCnX3^ 

GCCTGAG CCATGAAGATGCCCTGTcr L"rCXrrAGAAGGGTTGGTGAGCCGCCGGGAAGCTGCC CTGAAGGA 
GGACTTG CGCAGGGACACAGTGGCT CK^ATCCAGGAAGAGCTGGCT ACCCTGAGGGCAGAG CATCACCAG 
GACTCGGAAGACCTATT CAAGAAGATCGTCCAGGCC TCTCAGGAGT CTGAGG CCCGTGTCCAGCAGCTGA 
AGACAGAATGGCGAAGCATGGCCTTACCTTGCTTCCCTCCATCTGGAAATAGCATGACCCAGG^ 
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CCAGGAGAGCTCTGTGAAGGAGCTGGAGO^CTGGAAGCCCAACT 

GCCCTAACTCTGAAGCAGAACTCGGTGGCAGATGAAGTGGGCCTC 

GKX3CTGATGTGGAATCCCAGTTCCCTGACTGGATCAGTCAG 

CGGGCTCCTGCAGAGAGACGAGATGCATGCTCAGCTGCAGGAGCTGGAGA^ 

GCTGAAATGCAGGGCT^GTCAGCCAGGGAGGCCGCAGCTTCTCTGGGACAGACACT 

TAGTTGGGGTGACAGAGGAGCAAGTGCACCGGATTOTCAAGCAGGCC 

TATTGX3AATGGTGGATTATGCCCTGGAATGAGGA 

TACGAGACGAAGACAGCCCTTCTCAGCCTCTTTGGCATCCCCI^GTGG 

TCATTCTGCAGCCAGATGTGCACCCAGGCAACTC 

CCGCCTCTCTGCTCGTATCCGCCCC7VCAGCCGT 

AGCACAATCTCC&GTGCTCCCAAGGACTTCGCCATCT 

CACTTTTGGGCACGTTTGCCTAC^CCAGGATGGGGA 

GATGGCCACATACCAAGTTGTGGAGCTTCXJGATCCTGAC^ 

TACCGCTTCCGGGTGCACGGAGAGCCTGCCCACTAG 

Rat UNC84B Protein sequence (public gi: 27662736) 

MLQASESSNLFTEALTCTLNLMGSTFIAPILVL 

GPVVDGVCVLVCMjMHMHTLAHPLSGTSSIYVLLCRSGGEESPLCIjIM 
ASSVAGGQSTVFKDSPLRTLKRKSSNMKRLSPAPQLPPPSDSHTSYYSESW 

LDDHLHSEP YWS GDLRGRRRRGTGGSES SKANGLTMENKAS EDF FGS S S GYS SEDDIiAGYTOSDQHS SGS 
GLRSAASRAGSFVWTLVTLPGRLFGLLYWWVGTTWYRL 

IjIjLTGLTYGIjQTLQPAVASWWAAKESRRQPEVWDTRDASSHLQAEQRIL srvhslerrleaiaaefs snw 
QKEAIRLERLELRQGAAGHGGGSSLSHEDALSLLEGLVSRREAALKEDLRRDTVARIQEEIATLRAEHHQ 
DSEDIiF KKE VQASQES EARVQQLKTEWRSMALP C FPPSGNSMTQEAFQESSVKEIjERIjEAQIiAGLRQEIiA 
ALTIiKQNS VADEVGLLPQKI QAARADVESQ FPDW I S QFLLRDRGARS GIiLQRDEMHAQLQELENKI LANM 
AEMQGKS AREAAA S LGQTLQ KEGI VGVTEE QVHRIVKQALQ RY S EDRI GMVD YALES GGASVI STRCS ET 
YETICTAIiLSLFGIPLWYHSQSPRVILQPDVHPGNCWAFQGPQGFAVVRIiSARIRPTAWXiEHVPKAL 
STISSAPKDFAIFGFDEDLQQEGTLIiGTFAYDQDGEPIQTFYFQASKMATYQVVELRILT^GH^ 
YRFRVHGEPAH 



Mouse UNC84B mRNA sequence (public gi: 25070155) . 

TCCAGAGACGCCTCCCAGCACTTCCAGGCTGAGCAGCGCG 
GTCTGGAAGCCCTTGCTGCAGACTTTTCCTCCAACTGGCAGA 

GOTGCGGCAGGGGGCTGCTGGCCATGGAGGAGGCAGTAGCCTGAGCCATGAAGATGCCCTGTCTCTCCT 

GAAGGGTTGGTGAGCCGCCGCGAGGCTACCCTGAAGGAGGACTTC 

AGGAAGAATTGGCTACCCTGAGGGCAGAGCATC^CCAAGACTCGGAAGATCTCTT(^^ 

GGCCTCTCAGGAGTCCGAAGCCCGAGTCCAGCAGCTGAAGACAGAATGGAAAAGC^ 

TTCCAGGAGAGCTCTGTGAAGGAGCTGGGACGGCTGGAAGC^ 

CTGCCCTGACTCTGAAGCAGAACTCX^TGGCAGATGAAGTGGGCCTGCTGCCACA^ 

CAGGGCTGATGTGGAATCCCAGTT(X!CTGACT^ 

AGCGGGCTCCTGCAGAGAGATGAGATGCACGCTCAGCTGCAGGAGCTGGAGA^ 

TGGCTGAGATGCAGGGCAAGTCAGCCAGGGAGGCCGCAGCGTCCCTGGGACAGATO 

CATAGTTGG GGTGACAGAGGAGCAGGTGCACCG GATCGTCAAGCAGG CCCTG CAGCGCTACAGTG AGGAC 

AGGATTGGAATGGTGGATTACGCCCTGGAATCAGGAGGAGCCAGTGTTA^ 

CTTATCAGACC^GACGGCACTCCTCAGCCTCTTTGGCATCCCCCTCT 

GGTCATTCTGCAGCCAGATGTGCACCCAGGCAACTGCTGGGCC^ 

GTCCGCCTCTCTGCTCGAAT CCGACCTACAGCCGTTACCTT AGAGCACGTG C CCAAGGCCTTGTCACCCA 
ACAGCACTATCTCCAGTGCTCCCAAGGACTTCGCCATCr^ 

GACACTT CTTGGCACGTTTGCCTACGAC CAGGATGGGGAGCCCATC CAGACCTTCTATTTCCAGG CCTCT 
AAGATGGCCACATACCAAGTTGTGGAGCTTCGGATCCTGACCAACT 
TCTACCX3CTTCCGGGTG»CGGAGAGCCTGCCCACTAGT^ 
GTGGGAAGCTCATCTTTCCCAGCATCTACCCT 

CACATGCTTCTTAG CGCTGACTTCCAGGAGCAAGAGTGAAGAOT CAGCACGT AGAACTCCCTGCATAAG C 

CAAGCCACCAGGCT CCTATGCCTCT CTTAGTCTTCCCCCTAGAGCT AGGCGCCTACCAGGTG XATATATG 

TAGCATACTTTGGGGGACTGCGATCAGAAGTGAGGTGGAGGGACTGGCACCGGTTAA 

CCAAAGCCATGGGCATACau^GCCAGAGCCCCAGGGATAAGCTGGGGGGTCTTGTG^ 

GCTTAGC^CTGTGCAGCTGGGACTGTCCCTGGCAAGTGCCTGGTAAAGGAC^ 

TCCAGCATGGAGGGTACCATGGCCCXn'AGGAGCCTGGGGTTCCTAGCAA 

TGTGCCTCTGACGACTGTTCTCTGTGCCAACCTAAGAGATTCCCAGTGC^ 

TGATGTGGGGTGAGGGTCTATTCTGTTTCC 

Mouse UNC84B Protein sequence (public gi: 20902829) 
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MTQEAFQESSVKELGRIjEAQLASIiRQELAAItTLKQNSVADEV^ 

DRGARSGLLQRDEMHAQLQEIJ3NKI LTTO1AEMQGKSAREAAASLGQ IliQKEGI VGVTEEQVHRI VKQALQ 
RYS EDRI GMVDYAIiESGGASVI STRCSETYETKTALIjSLFGI PLWYHSQSPRVILQPDVH PGNCWAFQGP 
QGFAVVRLSARIRPTAVTIjEHVPKMiSPNSTISSAPKDFAIFGFDEDLQQEGTLLGTFAYDQDGE 
5 YFQASKMATYQWELRI IiTNWGHPEYTCI YRFRVHGEPAH 



Drosophila UNC84B Protein sequence (public gi: 7302310) 

MEVPTVRSPQREAEAI KVNMAS I EQNIQKALTAEEYEN ILNHVNS YVQQLVELKMQQHSKELAPQQIELF 

1 0 VKLMKENLKQIMYKTELSEKDLSDLAJCKLKLELQSS 

QLDR IDFASIiLERJ IiAAPALADFVDARI SLRVGELEPKES SGSSBAEVQI ERLNRET AFI KLAIiSDKQAE 
NADLHQS I SNLKLGQEDLLERI QQHBLSQDRRFHGIjIAEI EtTCCLSAI^^ ILGFK 
QSTAGGSAGQLDDFDLQTWVRSMFVAKDYLEQQLLELNKRTNNNIRDEI ERSS I LLMSDI SQRLKREI LL 
WEAKHNESTKALKGHX REEEVRQI VKTVLAI YDADKTGLVDFAIiESAGGQI LSTRCTES YQTKS AQI SV 

1 5 FGI PLWYPTNTPRVAI S PNVQPGECWAFQGFPGFLGKS RVNMLKLNSLVYVTGFTIiEHI PKSIiS PTGRI e 
SAPRNFTVWGLEQEKDQEPVLFGDYQFEDNGASLQYFAVQNIaDI 
RVHGKPPAT 

Human MSTP028 rnRNA sequence - varl (public gi: 14042294) 

20 CCCCGCCTCraCCCCCGGCrGGOSTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACT 

TGCGTCCTCC^CTTTTTOTGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTO 

CTACCCGC^CCACTTCCTTCAAGGGCACGAGCCCCAGCT 

CCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGAC^ 

ATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGTGGGAAGCACTTTGGTACGATAC 
25 TCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAG^ 

AG CCAAGTACTA CCTAGTCCAAGGC CTGGTGGAAGAGTG C(^GG^ 
GAGCCTTTCTGCAAGGTCCCTGTGATCACCTCATCX!AAGGAAGAACAAAAACT^ 

AGCCAGCCGTGAAGTTG CTCTACAAC^GAAGTAACAACAAATACTC^ CTGACGACAA 

TATGTTGAAAAAC^TTGAACTGTTTGATAAGCTGTCTCTGCG 
30 GATGTTAOTGGGGATGAAATCTGCTGCTGGTCCT 

GTACCTCCATCGTCTATGCCACTaAGAAGAAACAGAC 

GGAGACCCTCAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCT 

ACAGGCGGGGCGGCXX^GCGCTCCCACCACCTGGACGAGGACGA 

GGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCT^ 
35 CCCTCCTCTCACCGCCCCCACTCCCTGCCGTG^ 

GCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGAT^ 

AATTGATTGTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTG 

TGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGT^ 
40 GACACTCCCTTAAGGGTT^GTTTGACAATTCTGAGAGTT^^ 

TGAGCTCCCTGCTTCCTATTTCATAATCCTCCAGCCCCAGC^ 

CCCGGGCACAATCCCCACTGCTTTGCTAGACGTGCTTT 

ATAATTGCAGCTTGTGGCAGGGGAAATATGGCTGAATG^ 

GGGTGCAAGGCTTTGTTTAGGGATCAAGCCTTT 
45 GGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGTGTCCAGCTCTC 

TCCCCAACTCCAGCCXilCGTCCAGTTGTTCTC 

CTGTGACAGGCATAGCTC^TAAACrACCCCTCCCTGGGATCCCQCT 

AGCTGGGCTAACTTTCTAAGTCATT TTGCTTAGAAATTCAGTGTGGCCCATACCCTTTGTCCTCC CAG CC 
TGGCATCCAGGCAGGGACACCCTCACACCACCAGCCCCAGGGAGCrTCCCTC 
50 TTGTCTTTGCCTCTGATTTTTACACAGTGTAGAGTGGCCAGCAGTGAA 

ATAGATAACTTTGGGTCTGGTTTGTGTCKJTGTTCATGTTCGTTTAAGGG 

GGACGTGTGCTTGTGGGGCACAGGTGGCGG CCCCTGCTGGAGCC CGGCTGGG CGCAGCGCCTATGTAGGA 
CGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCTGCAAAGGAACAGGAGTGA 
ACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATGTGCGTCAOT 
55 GGGTTTGTCAAGGGAAT CXn^ACCGTTTTATACCAAAGGTATTAACATGGGCAG CCTTTGACACATGTAT 
TCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAGCTTAGACTTTC 
CAGTTGTATGCCTTCATTTTGTATCCAACAGCAAAGTCTACAATAAAACTTTAAAACA 

Human MSTP028 mRNA sequence - var2 (public gi: 13994352) 

60 GGAGACTCCTGCGTCCTCCGACTTTTCATGGAAGAGATGTCAGGAGAAAGTC 

CAGCGGCTGCTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATA 
GGGTGGAGCCCTCTACTATACCACCATGCAGACGCTGACCA^ 
AGCGGGCGC^TGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTCATTGA 
GTACGATACTCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCXSCCGGGAG^ 
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G CT AGCAGAAG CCAAGT ACTACCTAGTC CAAGGCCTGGTGGAAGAGTGCCAGGCGG CCCTACAAAACAAA 
GATACTTATGAGCCTTTCTGCAAGGTCCCTGTGATCACCT 
CTTCAAATAAGCCAGCCGTGAAGTTGCTCrAC/^ 
TGACGACAATATGTTGAAAAAC^TTGAACTGTTTGATAAGCTGTCTCT 
5 TTCATAAAGGATCTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGG^ 
AAGTCTGTTGTACCTCCATCGTCTATGCCACT 

GATTTATGAGGAGACC CTGAACATT TTG CTGTATGAGX3 CCCAGGATGGCCGGGG ACCTGACAATC CGCTC 
CTGGAGGCCACTVGGCGGGGCGGCGGGGCGCTCCCACC^ 
AGOSCGTGCGGAGGATCCACATCAAGCGCCCTGATG^ 
10 ACCGAGCCGGCCTCCTCTCACCX3CCCX:C^CT^ 

GCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACCT 

GACAAGGCAAATTGATTGTC TTGA C CCAGGCGTATGACC CCTGTCGTTGAACAAG CTGTGTCTAAGATCT 
CTACTTTTCATGAGAATCTGAGACT CTTTGGAGCCAGGCTTT CTCGGTTCT CAGAGGAAAAGTATGAATG 
AGTGTGAAGTGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGTGGGTGTCGACTTCCTATGTGGGC 
1 5 TTTTTGGGTGACACTCCCTTAAGGGTTCAG 

AGAGGTATCTGAGCTCCCTGCTTCCTATTTCATAAT 
TGTGTTTGGCCCGGGCACAATCCCCACTGCTTTGCTAGACGTGCT^ 

AGCTTGTTGATAATTGCAGCTTGTGGCAGTGGAAATATGGCTGAATGAGCGT CTAAAT CGTTGAGACCAG 
TGC^U^CTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGGCTTTTGCCACCTTGGGCTGGT 
2.0 TGCTCACTGGGACCCCATATGTCTGCGTAGGAGC^G^ 

CTGGTTCTTTCCCCMCTCCAGCCCCGTCCAGTTGTTCTCCTGATTGACCCGACT 

CATCTGACCCTGTGACAGGCATAGCTCATAAACXACCCCTCCCTGGGATCCCG 

TCCCCATGAAGCTGGGCTAACTTTCTAAGT(^TTTTGC 

CTCCCAGCCTGGCATCCAGG(^GGGACACCCTCACACCACCAGCCCCAGGGAGCTTCCCTGCTA 
25 C^GACCCCCTTGTCTTTGCCTCTGATTTTTACACAGTGTAGAGTGGCCAGCAGTGAAC7VGGTTGAGGATC 

TGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTOTGTGTTCATGTTTGT 

TGTGGGTGGGGACX5TGTGCTTGTGGGGCACAGGTGGCGGCCCCTGCT 

TATGTAGGACGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCT 

TOSTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTAC^ 
30 GTGATGGATGGGTTTGTCAAGGGAATCGTTACCGTTTTAT^ 

CACATGTATTCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAGCrrTAG^ 

TGCCTGTGACAGTTGTATGCCTTC^TT^ 

ATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA7VAAAAAAAAAAAAAA .... 

35 Human MSTP028 mRNA sequence - var3 (public gi: 25303941) 

CCGGGTTTGX3AGACTCCTGCGTCCTC^ 

AGCGGTGCCAGCGGCTGCTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCTCC^^ 
CTGAATGTGGGTGGAGCCCTCTACTATACCACCATGC^ 
CCATGTTCAGCXSGGCteCATGGAAGTGCTCACCGAt^ 
40 GCACTTTGGTACXIATAC^CAACTACCTTCGAGAC^ 

GAGGAGCTGCTAGCAGAAGCCAAGTACTACCTAGTCCAAG^ 

AAAACAAAGATACTTATGAGCCTTT CTGCAAGGTCCCTGTGATCACCTCAT C CAAGGAAGAACAAAAACT 
TATAGCGACTTCAAATAAGCCAGCCGTGAAGTTGCTCT^ 
AGCAATTCTGACGACAATATGTTGAAAAACATTGAACTCT 
45 GGGTCCTGTTCATAAAGGATGTCAT TGGGG ATGAA^ CTTTTATGGTCAGGGCCGGAA 
GATTGCTGAAGTCTGTTGTACCTCCATCGTCTATGCCACTGAGAAGA 
GAAGCCCGGATTTATGAGGAGACCCTGAACATTTTGCT^ 

GCGGATO^GCGCXSTGCGGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCT 
50 GGCAAGAGACCGAGCCGCCCTCCTCTCACC^CCCC^ 

GCTGCCGGGCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACT 

TATTGCTTGACAAGGCAAATTGATTGTC TTGACCCAGG CGTATGACCCCTGTCGTTGAACAAGCTGTGTC 

TAAGATCTCTACTTTTCATGAGAATCTGAGACTCTTTGGAGCCAGG^ 

TATGAATGAGTGTGAAGTGTATGTGAGAACTTTTGrrTGCAAT^ 
55 ATGTGGGCTTTTTGX^TGACACTCCCTTAAGGGTTCAGTTTG^ 

AGGCCACCAGAGGTATCTGAGCrrCCCTGCTTCCTATTTCATAATCCTCCAGCCCCA 

GGTTCCTGTGTGTTTGGCCCGGGCACAATCCCCACTGCTT^ 

GGGCCTAGAGCTTGTTGATAATTGCAGCTTGTGGCA^ 

GAGACCAGTGCAACTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGCC^ 
60 TGGCCTGGTGCTCACTGGGACCCCATATGTCTGC^ 

CTCTGTTTCTGGTTCTTTCCCCAACTCCAGCCC^GTCCAGTTGTTCTCCTGA 

AGGAAGGCCATCTCACCCTCTGACAGGCATAG 

AGCCTCCTTCCCCATGAAGCTGGGCTAACTTTCTAAGTCATT 

CCTTTGTCCTCCCAGCCTGGCATCCAGGCAGGGACACCCTCACACCACCAGCC 
65 TATAAACACAGACCCCCTTGTCTTTGCCTCTGATTTTTACACAGTGTAG^ 
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TGAGGATGTGCGGGTAGATAGATAACTTTGGGTCTCGTTTGTGTCT 
TGTGACTGTGGGTGGGGACX3TGTGCTTGTGGGGCACAGGTGGCCCCTGCT 
GCCTATGTAGGACGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCT^ 
GAGTCGTGACTGAGAGGGGTGGTTGAGACTAGACTAGGTAGAGT 
5 CAGGTGATGGATGGGTTTGTCAAGGGAATCGTTACCGTTTTATACC^^ 

TGAC^CATGTATTCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAG 
CCCTXSCCTGTGACAGTTGTATGCCTTCATT^ 

ATCATGACTGAATGTCAAAATCGTGTATTGGG CAGATG CTTTTT AAACTGTCGTGTGAGAAACTT TTATA 
TTAGGCC^TTTGGATTTTATTAAGTGCTAAGGAAAGAGGGCTTA CAAAATGTTTCGTAAATATTT TAT AC 
10 TGTTTAAGTGTTAAACA CCAACCCTCTCTTTCT TTTC^ CGAAGTGAATGT 
TGGCCAGGAAAATGGAAAAGO^TTGTATAAATTTTTTTTTGAGGCGGAGT 
TGGAGTGTAGTGGCACCATCTCCACTTACCACAACTTGTGCCTCCT 

AGCCTC CCX1AGT AGCTGGGATTGCAGGTACCCATCAGC CCATGCCCAG CTAATTTTGTATTT TTAGTAGA 
GATGGGGTTTCACCATCTTGGCCAGGCTGGTCTTGAACT 
1 5 CCAAAGTGCTGGGATTACAGGTGTGAGTCACCACACCTGGCTGCA^ 

AATGAGTTTGTGGAACAATTTGATTTGCTGTGGCCTOTATGCCTAATGAGCT 
CTCTACCXIAACrTTGCACTTGTAGTTTTGAGTCTTTGTCTCTCT 

CTGTTATTGAGTATACAAGGAACTGATTTTTCTTATGTTAGCA 
20 GTAAGCACTATCCAGGTAAAACACTGGCCCAAGATTTGGTAAAGA 

AGTTTTTTACAAATTGGAACAG CTTTGGTG TGT CGTAATCAAGGG TTTTTTT TGTTTGTTTGTTTCAAAT 
AAGCCATCTGATTGTGGTGACrGGGGCCCATGTCCAAGACAATTCCTGGCZATATTCTGTCACCCT 
GGGGCGATCACTGTGTGGGGACCCCATTCCCCAGTTAAAGTGTGTCTCTGTAC 
GGACCCAAGTGTQAACAACACTCAGCCCGCCCTCTGGAGCGTGTGCT^ 

2 5 CACTGTAACAGTTAAGTGTGTCATTAACCTTTCTGTCTCTTTGCGCCATAAAAAAATGCTCAAAG 

GATGTAGCCACTGTATGTTGTACAAACGTTGGCGACAT^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human MSTP028 mRN A sequence - var4 (public gi: 16552440) 

30 AGTCCGGGTTTQGAGACTCCTGCGTCCTCCGACT 
CAClXiTTGGGTGCCATGGAGTTGGGGAGAGT^ 
CATGACTCTCGCGTTGCTTCTGCTGTTGCAAGCT 
CGCTTOTCTTCTAC ^ GTTATAGTTCTAGTACC ^ TT 

TTTTTCAGTTTCAGGGAATAGTTCAAACAAGTTATGTGCTGTC^ 

3 5 AGCATACCTGTAGTCAAGCAAAGTTGGGTTTATTTCCT 

ATCTCAGAGAAGGGCTGTGGGCTTGTGTTGGGTGATTTGAGAGACAGTTCAGAG 

TGTGTTGGATG CTGCTGGGAAGCAGGGCTAATTCTGTGAT TGGGTCTCAGTGATT CCTGACTTGAAAGCA 
GGAAGAATGGAAGGAGG CTAAACTTCTCATTGGTAAAG CAG CAG CTGTAACTCCTATTAGCCAGGATAGG 
GGATCTTTGGTCATTTTTGTATTTTGGATAATGTTTATGTTTTO 
40 TGGTCCTGTTTTTGTCTTGTCG CAAGGGCACAGAGTGG CCTTGTCTGAG GGTGATGTGCTGTGAAAAACT 
GTTGATGTTCAATGGGAATGGTAGGGCCAGCCGTGGGGGC^^ 
CCTTGCACATTTCCACCTCTACAGTTTTACCTGTTCATTTCAGACATGTTO 

CGGATACCAATCTCACTTTCCAGGCCTG CGTAAATCAG CCACTGTATCCATT TCTTTGAGATGTACAGAG 
^ AGTCAGCCATGCTATCAGGGAGATGGTAGTGGGATCTTGCTC 

CAGGAAGTGATAGGAGTGCGAGCTGGAATCCCATTCAACTTC^ 
TGAAAAATGACACTTAGCTAGCTATTGAGTGGTACATC 

GCTTTAGGCTGTTTTAGGGTGCAGCCAGGGTGTTCATGTATACAGGTGCTAGGCAGAAA 
^ TAACACAAGAGTTAGGGGCACCCTTGTGCCTGCAGGGTCGAC^ 

CTCTCCTTAGTTGACCACATTGGAAGCAAACTTTTAAATGCTGTGTA 

CnGGAGGCCAGATTGAATCCACAGGCTGAAAGCAGTCAACCAGG C CTGATGTCATGACCCTGTAT CCTCT 
CCACTGGCAGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTGCTACrc 

5 5 ACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATAC^ 

CCACCATGCAGACG CTGACCAAGCAGGA CACCATGCTGAAGG CCATGTTCAGCGGGCGCATGGAAGTG CT 
CACCGACAGTGAAGGCTGGATC CT CATTGACOG CTGTGGGAAGCACTTTGGT ACGATACTCAACTACCIT 
CGAGACGGGGCGGTGCCITTACCCGAGAGCCGCCGGGAGATCGAGGAGCrG 
ACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCCCTACAACAGA 

60 CTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACAAAAACl^ 

GTGAAGTTGCTCTACAAOU3AAGTAACAACAAATACTCATATACCAGCAATTCT 

AAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTAACGGAAGGGTCCT 

TGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTC 

ATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCGGATTTA 
65 TGAAC^TTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTCCTGGAGG 
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GGCGGCGOTGCGCTCCCACCACCra^C 

CACATCAAGCGCCCTGATGACCGGGCCCACCTCCACCAGTGAGC^ 

TCACCGCCCCCACTCCCTGCCGTGCTAC^CCCAGATCCTGTGCAGGCTGCCXSGGCCCCOT 
TGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTT 
5 GTCTTCACCCAGGCGTATGACCCCTGTCGTTGAAC^GCTGTGTCTAAGATCTCTACTTTTCATGAGAAT 
CTGAGACTCTTTGGAGCCAGGCTTTCTCGGTTCTCAGAGGAAAAGTATGAA 

Human MSTP028 mRNA sequence - var5 (public gi: 21750697) 

GCTGGCGTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACT 
1 0 CATGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGCT 

TTCAAGGGCACGAGCCCCAGCTCCAAATACGTGAAGCTGAA 

TGOVGACGCTGACCAAGCAGGACACCATGCTGAAG^ 

CAGTGAAGAACAAAGATACTTATGAGCCrCTTCT 

AAAACTTATAGCGACTTCAAATAAGCCAGCTOTGAAGTTGCTCTA 
1 5 TATACCAGCGATTCTGACGACAATATGTTGAAAAACATTGAACTGTTTGA 

ACGGAAGGGTC C TGTT CATAAAGGATGTTATTGGGGATGAAATCTGCTG CTGGT C CTTTTATGGT CAGGG 

CCX3GAAGATTGCTGAAGTCTGTTGTACCTCCATCGTCTATGC 

TTTC CXX3AAGCCCX3GATTTATGAGGAGACC CTGAACATTTTGCTGTATG AGGCCCAGGGTGGCCGGGGAC 
CTGACAATGCX5CTCCTGGAGGCCACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAG 
20 GCGGGAGCGGATCGAGCGCGTGCGGAGGATCCACATCAAGCGCCCTGATG^ 



GTGCAGGCTGCCGGGCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTA 

TTTTGGTATTGCTTGACAAGGCAAATTGATTGTCTT 

TGTGTCTAAGATCTCTACTTTTCATGAGAATCTGAGA^ 



25 




AGTTGGAGGCCACCAGAGGTATCTGAGCTCCCTGCTTC 

ACTCCTGGTTCCTGTGTGTTTGGCCCGGGCACAATCCCCACTGCTTTGCTAG^ 

GGCTTTGGGCCTAGAGCTrGTTGATAATTGCAGCOT 
30 ATCXSTTGAGACC^GTGCAACTTTGGGTGCAAGGCTO 

GGTCTTTGGCCTGGTGCTCACTGGGACCCCATATGTCr 

GTCCAGCTCTGTTTCTGGTTCTTTCCCCAACTC 

CACTCCAGGAAGGCCATCTGACCCTGTGACAGGCATAGC 

CTCTTCAGCCTCCTTCCCCATGAAGCTGGGCTAACTTTCTAAG 
3 5 CC^TACCCTTTGTCCTCCCAGCCIXXSCAT^ 

CCCTGCTATAAACACAGACCCCCTTGTCOTTGCCT 

ACAGGTTGAGGATGTGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATC 
GGGATATGTGTGACTGTGGGTGGGGACGTGTGCTTGTGGGGCACAGGTGG03GCCCC 
CTGGGCGCAGOGCCTATGTAGGACGGGTGTTCTCA^ 
40 GGAACAGGAGTGAGTCGTGACTGAC^GGGGTGGTTG 
TGAATGTGCGTCAGGTGATGGATGGGTTTGTCAA 

TGGGCAGCCTTTGAC^CATGTATTCCAAAAACGAGTTTATATTTTCAAACGGTT^ 

TGTACTTACTGCCCTGCCTGTGACAGTTGTATG CCTTCATTTTGTATC CAACAGCAAAGTCTACAATAAA 
ACTTTAAAACAATCATG 

45 

Human MSTP028 Protein sequence - varl (public gi: 13994353) 

MEEMSGESWSSAVPAAATRTTSFKGTSPSSKYVKLNVGGALYYTTM 

3 EGW1LX DRCX3KHFGTI IJJSnjRD GAVPLPESRREI EELIAEAKY YliVQGLVEECQAALQNKDTYEPFCKV 
PVITSSKBEQKLIATSNKPAVKLLYNRSNNKYS YTSNSDDNMLKNI ELFDKL SIjRFNGRVLF IKDVIGDE 
50 ICCWSFYGQGRKIAEVCereiVYATEKKQTKVEFPEARIYEET 
RS HHLDEDEERERI ERVRRI HI KRPDDRAHLHQ 

Human MSTP028 Protein sequence - var2 (public gi: 14042295) 

M S GES WS S AVP AAATRTTS FKGTS P S S KYVKL WGGAL YYT TMQTLTKQDT^KAMFSGRMEVliTDSEG 
55 WILITOCXSKHFGTIIiNYIiRTCAVPLPESRREIEEUiAEAKYYLVQGLVEE 
TSSKEEQKLIATSNKPAVKLLYNRSNNKYSYTSNSDDNMLKNIELFDK^ 

WS FYGQGRKIAEVCCTS I VYAT EKKQTKVEFP EAR I YEETLN I LL YEAQBGRGPDNAUjEATGGAAGRSH 
HLDEDEERERIERVRRIHIKRPDDRAHLHQ 

60 

Mouse MSTP028 mRNA sequence (public gi: 13905271) 

CTGGGTGTTTCCTGCCTCTCAGTCCGG^ 

TCAGGAGACAGCGTGGTGAGCTCAGCGGTGCX^GC&GCTC 

GCCCCAGCTCCAAGTACXyrGAAGCTGAACGTGGGGGGCGCrCTCT 
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CAAGCAGGACyvCCATGCTGAAGGCCATGTTCAGTGGACGCATGXS^ 
ATCCTCATTGACCGCTGTGGGAAGCACTTTGGGACAAT^ 

TGCCTGAGAGCCGCCGGGAGATCGAGGAGCTGCTAGCCGAGGCCAAGTACTACCTGGTGCAGGGCCTGCT 
GGAAGAGTGCCAGGCCX3CCCTACAAAACAAAGATACTTACGAGCCCTTCTGCAAGG 
5 TCCT CCAAGGAAGAACAGAGGCTTATAGCGACGTCAAATAAGCCCGCCGTGAAG CTGCTCTACAACAGGA 
GCAACAACAAGTACTCCTACACCAGCAATTCTGACGATAACATGTTGAAAAACATC 
GCTGTCCCTGCGCTTCAACGGGCGGGTCCTGTTCATTAAGGAC^ 

TCATTTTACGGCCAGGGCCX3GAAAATCX3CTGAAGTCTGTTGTACCTCCATCGTOTAOT 

AGCAGACCAAGGTTGAGTTCCCTGAAGCCCGCATTTATGAGGA 
1 0 CC^GGATGGCCX^GGACCTGACAATGCCCTCCTGK^ 

CTGGACGAGGACGAGGAGCGGGAGCGGGAGCGGATCGAGCGCGTGAGGAGGATCCATATCAAGCGCCCAG 

ATGACCGGGCCCACCTCCACCAGTGAGCAGGCCGAGCACCCTGCCTTCTGCCCTCCCTCTGCTCCTGCCC 

CGCCCCCTCAGACCCTGTGCAGGCTTTGGGGCACCTCCC^CTTCCCCTGGAGTCTGAGACACTTTTGTAA 

CAAGCCAGATGATTATTTTGTTATTTCTTGACAM 
1 5 CirCTGAAC^TGCTCTGTCTGGGGTCCCACCT 

AGGAAAACGTATGAATGAGTTTGGCGTGTATGTGAGAACCT^ 

GACTACCTATTAGGGCCTCTTAGGTGACACTCCC^ 



20 



40 



CTCT CCTGTGAGTCTGCTCTGTGTGCX3GTTTTGTACAGAGC CCC TTGGG CACAGTGGAAGGGGGACGAAG 



GCAGTTGGCTGCTCTCTCGGCTCAGTCCTrTT^ 
TAAGGGTCTACT CCTCTCTTGCATGGCTTCCTTTTCTCAT CAAGGGGGTGCCTTGCCCTTTGGTTAGGAA 
CTGGGGTGACCGC^GACACTGGCACCAGGTGAGGACACCACTGAGTTTGCAGCAGTTTCCCTGC^GAATA 
GAGTGGGCATGCAGATACCTTTGGGTCITGTTGTGTTCAGGGGATGTGTGTGTGA 
25 GTGACTGTAGGGCGCAGGCGCTCACGACAACAGTCAGTTACCT 

GGCTGAGTAGACTAGTTTCAAGGTTAGGTGAGAGTGTGGGATGGATTTGGCTGCTTGATAAAGGAAGTGT 
TGGCTATTTTTTAGACCAAAGGTATTAAGTGGGCAATCTCTGACAGGTTTTAAAATTTCTATTTAGCGAC 
GGTTTGTACAGCTCGCTTTGTACATGACAGTTGTATACC^ 
AAATACTCAAACAATCCCGACTGGATCGAGGAACCATC 
30 AAGGGAGCTTACGAATCATGTTTCCTACGTTTTTATACTGTTTC 

GTTCAG CTT TTC TAGGAAGTGAATGTCAGTAOTGGTGTTTT CTATAGGAATGGAGAAGC CATGTATACAC 
TGTGTAAATGCTCATGTGAGAATGACCTAGCGGCACAATCTGACTTGCCTTGGCCTCTGGC 
ACTGTTTTTGGCAGCTCTCTCTACCTTCCTCTATCCTCAAACCT 
CCCAGGGATAGGAACAGACCTAGTGAACATTCCACGGTGCCTGAT 
3 5 GGCCTGACCCAGCGTCAGTCTCCAAAGCTCTGCTTCCGGAT^ 
AGCACTTCTAGATCACOVTCTAGTGAGTCGCTGGTC 

AATT AAAATAAACCAGAATAGCTTTTTGCTCATGGT AA CCAAGT TCAGTGTCTGTGGGGCCCACAGCCTG 
GCAGGTCTGGCCCAGCTCrCTGTCACCTGCTGTGGGAGATGGACCGTTTGACCTCTCTGGAGGCTGAGAC 
CCCATACCTGCTGCTCAGTGCTGGTAATCAGCCCTCCCCAGAGTGCGTGCCGGCCAGAGGGCTCCACCCA 



AGAAGTGCTCACTCTTGTACGAGTGTCTGCGGACATGTGTAAAATAAA 
AAAAAAAAAAAAAAAA 



Mouse MSTP028 Protein sequence (public gi: 13905272) 

45 MEEMSGDSWSS AVPAAATRTTSFKGAS PS SKYVIQjNVGGALYYTTMQTLTKQDTMLKAMFSGRMEVLTD 
SEGW ILIDRCGKHFGTI LNYLRDGGVPL PESRRE I EELIAEAKYYIiVQGIjIjEECQAAI^NKDTYEPFCKV 
P VI TSS KEEQRIiIATSNKPAVKLLYNRSNNKYS YTSNS DDNMLKNI ELFDKLSLRFNGRVLF I KDVIGDE 
ICCWSFYGQGRKIAFWCCTSIWATEKKQTKVEFPEARIYEETLNILL^ 
RS HHliDEDEERERERI ERVRRIHI KRPDDRAHLHQ 

50 

Drosophila MSTP028 mRNA sequence (public gi: 24585830) 

CTTAAAATATAAATTGAAATCGATGGGTCGTTTC 
AACAATAAATACATACATTAAATATACCTTTAATACAATA 
55 TAATCACTATTTOSCCGAATTTGCATAAACGCATGAGA^ 

AGTAGCTGAACCCCGAAAGCTAGAGAACAAAAAGTATAAATGTCGGAATCAA 

TGGAA CACTTGACGAAAAACAATGACACAATGCTGAG CG CAATGTTTAGTGGTAGAATGGAAGTGCTGACT 
GATT CGGAAGGATGGAT TTTAATCGATCGCTGTGGAAAT(^TTTTGGTATCATT CTTAATTATTTAAGAG 
60 ATGGCACTCTTCCATTACCAGAAACTAACAAGGAA^ 
CATTACCGAGTTAGCTATTTCTTGTGAACX3GGCACr 

ATTCCACTTATAACTTCACAAAAGGAAGAGCAG CTTTT ATTAAGTGTTTCTTTAAAGCCGGC TGTTATTC 

TTGTGGTTCAGCGGCAGAATAACAAGTATTCGTACACAAGT^ 

TGAATTATTTGATAAGCTTTCTTTGCGCOT 
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AGTGAAATCrGCTGCTGGTCATTTTACGGACACGGAAAAAAAGT^^ 

TTTATG CAACTGACAGAAAGCATACTAAAGTTGAATTTCCGGAAGCTCGTATATACGAAGAAACGCTACA 
AGTCTTGCTTTATGAAAACCGCAATGCACCAGACCAAGAGCT 
GGAAGTGC^VAGTGGAACCAGCATTAATCAGTAC^CAAGCXIATGAGG^ 
5 GATTACG&TCCAACAAGCGTAATAATCCGTCCTGATTT^^ 

TTTTATAAACGATTAAAAAAAATATATATATGTTTTCTTTTCTTAATAAAA 

Drosophila MSTP028 Protein sequence (public gi: 19921650) 

MSESMSGDHKILLKGHSSQYIiKIiWGGHLYYTTIGTLTm 
1 0 HFGIILNYIjRDGTVPLPETNKEIAEIiIiAEAKYYCITEI^ 

I,SVSLKPAVILWQRQNWKySYTSTSDDNLLKNIELFDKLSI.RFNKRII*FIKDVIGPSEICOTSFYGHGK 
KVAEVCCTS I VYATDRKHTKVEFPEARI YEETLQVLL YENRNAPDQEIiMQATSSARVGSASGTS INQYTS 
DEEEERTGLARLRSNKRNNPS 



15 Human HERPUD1 mRNA sequence - varl (public gi: i65078oi) 

AGAGACGTGAACGGTCGTTGC^GAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCT 
GCGGAGCCCCGACAC<^CCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTG 
AAGAGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCT 
AGGCCCACCTGAGCOTOTTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTC 
20 GCTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTT^ 

TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACX5CCAAGGTGGCTGAATCCACAGAGGAGCCTG 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCT 

GAACCTTTCTTCCCCTGGATGGGAAAAGATCTCAAGGCATCACGTTGGGTG 

CCGGTTCAGAAC TTCCCAAATGATGGTCCTCCTCCTGACX3TTGTAAAT CAGGACCCCAACAATAACTTAC 
25 AGGAAGGCACTGATCCTGAAACrrcAAGACCCCAACCACCrCCCT 

GCAGACCAGCCCCTCCTT TATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTT TGCCTCT CTTCTTCCA 

GAAGGCCCCCCAGCCATCGCAAACTGATGGTGTTTGTGCTCTAGCTGTTGGAGG 

CTGGATCACCTGACTCCAGCTAGATT^ 

GGATGATGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTG^ 
30 • AAATGCCCAAGGCTTCTCATGTCTTTATTCTC 

CTGTACGTAGAAGGCCTTAGGTGTTGCATGTCTATGCTC 

TTTAAATTACACTAAGTGTACTACTTTA^ 

CTAGGAAAGACTTATGTATAATTGCTTT TTAAAATG CAGTGCTTTACTT TAAACTAAGGG GAACTTTGCG 
3 5 GAGGTGAAAACCTTTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGAC 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human HERPUD1 111RNA sequence - var2 (public gi: 10441910) 

40 GCTGTGTGGrcCAGGCTTTTCTCAAACTCCTGA 

TGGGACTAC^GGC^TGTGCCACTAGACCTGGCTCTAAA 

C^TTTCACAATGTTTATTCACATATATGGTATTAGTATTCTAATGXAGTGATGCACTCTAAATTTG 

ATATTTCCTAGAACATCTGAACAGAGCATAGGAAATTC^ 

CTTAAAAGCACT^TATCATTTCATTTCCCTGCACT 
45 CAAGGCTTACTTATATTGGAATACTATTTTAGAAAGTTGTGGGCT 

TCAGATGTCTGCAATGAGTAAATTTAGCACCATTAT CAGGAAGCTTTCTCACCAATGACAACT TCATTGG 

AAGATTTTAATGAAAGTGTAGCATACTCTAGGGAAAAAATATGAATATTTTAGCATCTATGTAT^ 

TTATGTTGAATAAATGTCAGACTATTTT TTACATAACGTTG CTTCTGTTTAATTTTGTCACGTTCAGAGG 

TGGGGGGTAGGAGATGTAAGCCCTTGACAGCAAAATAATTCCTTTTGeTTGAT^ 
50 GCTCCTTTGTTCTGTGTTCATGTTACACTTATTTAGGTGGCTGAATCCA 

ATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGT^ 

TTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGGTGCCCAGCAOT 

TTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTC 

ACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCC^ 
55 TGTGGTCTCTGCACCTGCTCCAGCC 

AATGCTGCTCCTCAAGTGGTTGTTAATCCTX3GAGCCAATCAAAATT 

CTATTGTGGAAGAAGATGATGAAATAAATCX3AGATTGGTTGGATTGGA 

TGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTG 

GTTATGTACCTGCATCAC^TTGGGTGGTTTCCATTTAGACCGAGGCCG^ 
60 GTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACT^ 

AGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGAC 

ACAGCVVTGGCTTGTCTTCT^GACTTTCT TTG CCTCTCITCTTCCAGAAGGCCCCCCAGCCATCG CAAACT 
GATGGTGTITGTGCTGTAGCTGTTGGAGGCTT TGACAGGAATGGACTGGATCA CCTGACTCCAGCTAGAT 
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TGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAA(^GTGTGGATGATGATATGCTTTTGTGAGCA 
AGCAAAAGCAGAAACGTGAAGCCX3TGATACAAATTGGTGAACAAAAAATGCCCAAGG 
TATTCTGAAGAGCTTTAATATATACTCTATGTAGTTT^ 
GCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCT 
5 ATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTGTACTACT 
TTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTATAATTGC 

CTGTTCAATAAAGTTTTACTATGAATGACAAAAAAAAAAAAAAAAAA 

10 Human HERPUDl mRNA sequence - var3 (public gi: 3005722) 

GGCCACCTa^AGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTT 
GCATCTGGTGTGCAATGTGAAGAGTCCTTCAAAAATGC 

GAGGAGCCTGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGG 
1 5 AAGTTCTTCGGAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTG 

CCAAGGCCTGGGTCCTGGTTTCTCCGGTTACACACC 

ATATATGCACGACAGTACTACATGCAATATTTAGCAGC(^ 

CAAGTGC^CAAGAGATACCTGTGGTCTCTGCACCTGCTC 

AAACCAGCCTGCCAATCAGAATGCTGCTCCTCAAGT 
20 ATGAATGCACAAGGTGGCCCTATTGTGGAAGAAGATGATG 

ATTCAGCAGCTACATTTTCTGTTTTTCTCAGTATCCTC 

GGTCATGGGGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTT 

CAGAACTTCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAAGAATAACTTA 

GCACTGATCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGAC 

25 CAGCCCCTCCTTTATGAGCACAGCATGGCTTGTCTT^ 

CCCC CAG CCATCGCAAACTGATGGTGTTTGTGC TGTAGCTC 
CACCTGACTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTT^ 
TGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAAT 
CCAAGGCTTCTCATGTCTTTATTCTGAAGAGCTTTAATATA 

30 GTAGAAGGCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGT 
TTGTAC^TAGAAGTCATAGATGCAGAAGTGGTT'CTGCTGGTA 
TTACACTAAGTGTACTACTTTATATAATCAATGAAATTGCT 
AAGACTTATGTATAATTGCHTXTTTAAAATGC^G 

AAAACCTTTGCTGGGTTTTCTGTTCAATAAAGTTfTAOTATGAATGACCCTGA 
35 AAAA 

Human HERPUDl mRNA sequence - var4 (public gi: 21619176) 

CCACGCGTCCGGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGC^G 
CGGAGCCCCGACACCGCCGCCXSCCGCCATGGAGTCCGAG^ 
40 AGAGCCCCAACCAGCXSCOVCCGCGACTTGGAGCTGAGTGGCGACCGa^CTGGACT 

GGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAG 
CTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTO 

TGTGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATC 
^ TGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTC^ 

TGGGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCrrTCAGCTTTCCTGG 
ACGACAGTACTACATGCAATATTTAGCAGCCACTGCTC 

CAAGAGATACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCC^ 
CTGCCAATCAGAATGCrrGCTCCTCAAGTGGTTGTTAATCCTGGAGCCA^ 
50 ACAAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTG 
GCTACATTTTCTGTTTTTCTCAGTATCCTCTAC^ 

GGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAG 
CCCAAATGATGGTCCTCCTCCTGACGTTGTAA^ 

C CTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAG GGATGTACTAGATGGCGAGCAGACCAG CCCCT 
55 CCTTTATGAGCACAGCATGGCTTGTCTTCAAGA 

CATCG CAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCT^^ CTGAC 

TCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAA 

TTTTGTGAGCAAGCAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAA^ 

CTC^TGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACT 
60 C TTAGGTGTTGCATGTCTATG CTTG AGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACATA 

GAAGTCATAGATGCAGAAGTCX3TTCTGCTGGTACGATTTGATTC 

GTGTACTACTTTATATAATCAATGAAATTGC^ 

GTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACI^ 

GCTGGGTTTTCTGTTCAATAAAGTTTTACn'ATGAATGACCCTGAAA 
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Human HERPUD1 mRNA sequence - var5 (public gi: 14249882) 

AACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCGGAGCCC 
CGACACCGCCGCCGCCGCC^TGGAGTCCGAGACCGAACCCGAGCCCGTC^CGCTCCTGGTGAAGAGCCCC 
5 AACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTC 

TGAGCCGC^TCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCT 

GTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCC^ 

CTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAG 
1 0 'i w iXj\iTJLCCCC TGGATGGGAAAACAT CT O^AGGCCTGAAGCTGCCCAGCAGGCA TTCCAAGGCCTGGGTC CT 
GGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGG 

ACTACATGCAATATTTAGCAGCCACTGCTG(^TCAGGGGCTTTTGTTCCACCACCAAGTC 
ACCTGTGGTCTCTGCACCTGCTCQVGCCCCTATTC 

CAGAATG C TGCTCCT CAAGTGGTTGTTAAT CCTGGAGC CAATCAAAATTTG CGGATGAATGCACAAGGTG 
1 5 GCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTC 
TTCTGTTTTTCTCAGTATCCTCTACTTCTA 

GTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACTTCCCAAATG 
ATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCQ^ 

TGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCCT 
20 AGCACAGCATGGCTTGTCTTCAAGACTTTCTTT^ 
ACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGC^ 
GATTGCCTCTCCTGGAC7VTGGCAATGATGAGTTTTTAAA 

GCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGCCCAAGGCT 

CTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCTTAGGT 
25 GTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGC 

TAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCT6TTGGM 

ACTTTATATAATCAATGAAATTGCTAGACATGTTT^ 

TGCTTTTTAAAATGC^GTGCTTTACTTTAAACTAAGGGGAACTTTC 
^ TTTCTGTTCAATAAAGTTTTACT^^ 

Human HERPUD1 mRNA sequence - var6 (public giT 12652674) 

GAACTGTCGTTGCAGAGATTGCGGG CGG CTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCGGAGCC 
CCGAC^CCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTC^CX^ 
CAACCAGCGCCACCGCGACTTGGAGCTGAGTGGC<^C^ 
35 CTGAGCCGCGTCTACCCCX2AGCGTCCGCGTCCAGAGGA 

TGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCA 
TGTGAAGAGTCCTTOUVAAATGCCAGAAATC^CGCCAAGGTGGC^ 

TCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCGGAACC 
TT TCT TCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGC^ 
40 TGGTTTCTCCGWTTACACACCCTATGGGT^ CAGATATATGCACGACAG 
TACTACATGCAATATTTAGCaGCCACrGCTGCATCAGGGGCTTT^ 

TACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCTGCCAA 
TCAGAATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGC 
GGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGA 
45 TTTCTGTTTTTCTCAGTATCCTCTACTTCrACrCCTCCCT 
CHSTTGTTATGTACCTGCATCACXSTTGGGTGGTT^ 

GATGGTCCTCCTCCTX^CGTTGTAAATCAGGACCCCAACAATAACTTA 

CTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACIA^ 

GAGCACAGCATGGCTTGTCTTCAAGACTTTCT 
50 AACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATG<^ 

AGATTGCCTCTCCTGGACATGG CAATGATG AGTTTTTAAAAAACAGTGTGGATGATGATATGCTT TTGTG 

AGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGG 

TCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTA 

TGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTC 
5 5 ATAGATGCAGAAGTGGTTCTGCTGGTACGATTT^ 

TACTTTATATAATCAATGAA ATTG CTAGACATCTTTTAGCAGGACTTTTCT 

TTGCTT TTTAAAATGCAGTGCTTTA CT TTAAACTAAGGGGAACTTTGCG GAGGTGAAAACCTTTGCTGGG 
TTTTCTGTTCMTAAAGTTTTACTATGAATGAAAAAAAAAA 

60 Human HERPUD1 mRNA sequence - var7 (public gi: 97H684) 

AGAGACGTGAAOTGTCGTTGCAGAGATTGCGGGCGGCTGAGACG CCGCCTGC CACCTAGGAGCGCA 

GCGGAGCCCCGACACXX3CCX3CCGCCGCGATGGACT 

AAGAGCCCCMCCAGCGCCACCGCGACTTGGAGCTGAG 

AGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAAT^ 
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GCTGTTGTTGGATCZACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGC^ 
GTGTGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCIAAGGTGGCT 

CTGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGXTTAAGGCAAAGGGAAGTTCT 
^ TCGGAACCTTTCXTCCCCTGGATGGGAAAACAT 

CACGACAGTACT ACATGCAATA TTTAGCAG CCA CTG CTGCATCAGGGG CTTTTGTTCCAC CACCAAGTGC 

ACAAGAGATACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAA 

CCTGCCAATCAGAATGCTGCTCCTCAAGTGGTTGTTAATC 

CACAAGGTGGCC CTATTGTGGAAGAAGATGATGAAATAAAT CGAGATTGGTTGGATTGGACCTAT TCAGC 
1 0 AGCTACATTTTCTGTTTTTCTCAGTATCCTCTACn^CTACT CCTCCCTGAG CAGATTCCTCATGGTCATG 

GGGGCCACCGTTGTTATGTACCTG(^TCACGTTGGGTGGTTTCCATTTAGACCGAGG 

TCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGG& 

TCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGT^ 

TCCTTTATGAGCACAGCATGGCTTGTCITCAAGA 
1 5 CCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGG 

CTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGrT^ 

C TTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGC CGTGATACAAATTGGTGAACAAAAAATG CCCAAGG C 
TTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACT 
G CCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTT CCAAATGTGTGTGTCTG CATGTGTGTTTGTACA 
20 TAGAAGTCATAGATGCAGAAGTGGTTCTG CTGGTACGATTTGATT CCTGTTGGAATGT TT 
AAGTGTACTACTTTATATAATCAATGAAATTGCTAGAC^TGTTTT^ 

ATGTAT AATTGC iTlT'i'AAAATGCACTGCTTTACTTTAAAC TAAGGGGAAl; 1 11 GCGGAGGTGAAAACCT 
TTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

25 Human HERPUD1 m RN A sequence - var8 (public gi: 3005718) 

GACGTGAACGGTCGTTGCAGAGATTGCGGGCXX3 

GAGCCCC^CACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCXICGTCaCGCTCCTGGTGA^ 
AGCCCCAACCAGCGC^CCGOSACTTGGAGCTGAGTGGCGACTO 

CCCACCTGAGCCGCGTCTACCCCGAGCGTCOGCGTCCAGAGC^CCAGAGGTTAATTTATTCTGGGAAGCT 
30 GTTGTTGGATCACCAATGTCTCAGGGACI^GCTTCCA^ 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCA^ 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCT 

GAACCTTTCTTCCCCTGGAT GGGAAAACATCTGAAGG CCHXIAAGCTGCCCAGCAGGC^ 
GGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTC^ 
3 5 GACAGTACTACATGCAATATTTAGC^GCCACTGCTGCATCA 

AGAGATACCTGTGGTCTCTG(^CCTGCTCC^GCCCCTATTCAC^CCAGTTTCCAGCTGAAAACC^GCCT 

GCCAATCAGAATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAA 

AAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATT 

CAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACC^ 

TGAAACTGAAGACXTCCAACCACCTCCCTCCAGACAGGGATGT^ 

TTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTC^ 

45 CAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTrAAAA 

TTGTGAGCAAGCAAAAGC^GAAACGTGAAGCC^TGATACAAATT 

TCATGTCn^ATTCTGAAGAGCTTTAATATATACTCTATGTAGT^ 

TTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTC 

AAGTCATAGATGCAGAAGTGGTTCTGCTGGTACmTTTGATTCCTC 
50 TGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCT 

TATAATTG CTTTTTAAAATGCAGTG CTTTACTTTAAACTAAGGG GAACT TTG CGGAGGTGAAAACCTTTG 



Human HERPUD1 mRNA sequence - var9 (public gi: 285960) 

5 5 CGTGAACGGTCGTTGCAGAGATTGCGGG CGGCTGAGACGCCGCCTG CCTGGCACCTAGGAGCGCAGCGGA 
GCCCCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCT 
CCCC^CX^GOSCC^CCGCGACTTGGAGCTGAGTGGCXIACXX^ 
CACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGXTAATT^ 
TGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGjCAGGAAAA 

60 CAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACxfc 
GGTTCTAATCGGGGACAGTATCCTCAGGATTCCTC^^ 
ACCTTTCTTCCCCrGGATGGGAAAACATCTCAAGGCCTGAA 
T CCTGGTTTCTCCGGTTA(^CACCCT ATGGGTGGCT TCAG CTTTCCTGGTTC(^ 
CAGTACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCT^ 



-133- 



CAATCAGAATGCTGCTCCTQUIGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGATGAATG 
GGraCCCTATTGTGGAA(^GATGATGAAATAAAT^ 

^TTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGT 
AATC^TGGTCCTCCTCCTGACGTTGTAAAT^^ 

AAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGC 
TATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAQAAGGCCCCCCAGCCftTe 

G ^™^ G 7™ TGCTGTAGCTCT ^ 

GCTOGATTGCCTCTCCnX^CATCMCAATGATGAGTTTTTAAAAAACAGTGTGGAl^ 
GTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTC^CAAAAAATGCCCAAGGCTOCT 

AGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACAT^^ 
GTCATAGATGCAGAAGTGGTTCTGCTGGTAAGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTG 
™™™ ATAATCA ^^ 

* AA Z3 G , c IT mAAAATCCAGTG ^ 

GGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

Human HERPUD1 mRNA sequence - varl 0 (public gi: 7661869) 

ZU G^^TGAACGGTCX3TTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCG 
G ^™ CGACACCGCCGCCGCCGCCAT&SAGTCCGAGACC ™^ 

AGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGG 
CCCACCTGAGCCGCGTCXACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCT 
GTTGTTGGATCACCAATGTCTri^GGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCTGGTG 
* G J^?^ GAGTCCTTCAAA ^^ 

C T GG JI^ A J^ CGGGGACA ^^^ 

GAACCTTTC^ITCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTG 
GGTCCTGGTTTCTCCGGTTACAC^CCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGCAGAra 
GACAGTACTACATGCAATATTTAGCAGCCACTGCTGCATGAGGGGCTTTTriTTr'f'aor'ar'nA 7v r--r-/--<~ 



10 



15 



25 



30 



AAGG ?^ CCTATrG '^^ 

TACATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCrGAGCAGACT 

GCCACCGTTGTTATGTACCTGCATCACG1TGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACTTC 

TGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCCTCC 
TTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGCCCCCCAGCCA 

tcg^ctgatggtgtttgtgct^^ 

CAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGATGATATGCTT 
^ G ^™ GC ^ GCAGAAACGTGAAGCCG ^ TO 

^^ C ir rAOTC1 ^ GAG< ^ AATATAraC ^ A TOTAGTTTAATAAG^ 
TIAGGTGTTCCATGTCTATGCTTGAGGAACTT^ 

AAG J CA * AGA I GCAG ^^ 

^ AC Z A 5^I^ TAATCAATGAAA " GCTAGACATG1 ^ 
45 I A ^^I^ AAAA! ^ G TCCTrDVCTTTAAACTAAG<MGAACT^ 

CTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAAAAAAAAA 
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Human HLRPUD1 Protein sequence - varl (public gi: 16S07802) 

?^^^^ VC ™^ PS ™ PE11 ^ AES ^PAGSTOG^ 
AWuVFKXF FASLLPEGPPAXAN 

Human HERPUD1 Protein sequence - var2 (public gi: 10441911) 

*^ n D J?^^ D ^ SAATP ^ 
PPPDVVNQDPNNNLQEGTOPETEDPNHLPPDRDVLDGEQTSPSFMSTAWLVFKTFFASLI.PEGPPAIAN 

Human HERPUD1 Protein sequence - var3 (public gi: 3005723) 

EEPAGSMRGOYPEDSSSDGl.Rrn>inn.DMi.cenr^.™.,->-«^ 



50 
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PPAIAN 

Human HERPUD1 Protein sequence - var4 (public gi: 7661870) 

MESETEPEPVTLLVKSPNQRHMLEIiSGDRGWSVGHL 
5 DLLPKQEKRH\njHLVCNVKS PS KMPE INAKVA^ SPGWE 
NISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQIYARQYYMQYLAATAASGAFVPPPSAQEIPWSAP 
APAPIHNQFPAENQPANQNAAPQVVVNPGANQNLRIW^ 

LYF YSSLSR FLMVMGATVVMYIiHHVGWFPFRPRPVQNF PNDG P PPDVVNQDPNNNLQEGTD PETEDPNHL 
P PDRDVLD GEQTSP S FMS TAWLVFKTFFAS LIjP EGP PAIAN 

10 



Rat HERPUD1 mRNA sequence (public gi: i67S896i) 

AAGACACCAAGTGTCGTTGTGGGGTCGCAGACGGCTGCGTCGCCGCCCGTTCGGCATCCCTGAGCGCAGT 
CGAGCCTCCAGCGCCGCAGAC^TGGAGCCCGAGCCACAGCCCGAGCCGGTCACGCTGCTGGTGAAGAGCC 
15 CCAATCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGTTGGAGTGTGAGTCGCCTC^GGCCCA 
CCTGAGCCGAGTCTACCCCGAACGCCCGCGCCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCTGCTG 
TTGGATCACCAATGTCTCCAAGACTTGCTTCCAAAGCAGG 
ATGTGAGGAGTCCCTCAAAAAAGCCAGAAGCCAGCACAAAGGGTGCTGAGTC 

CACTAGTCAGGCACAGTATCCTGGGGATTCCTCAAGCGATGGCTTACGGGAAAGGGAAGTCCTTCGGAAC 
20 CTTCCTCCCTCTGGATGGGAGAACGTCTCTAGGCCTGAAGCCGTCCAGCAGACTTTCCAAGGCCTCGGGC 
CCGGCTTCTCTGGCTACACCACCTACGGGTGGCTGCAGCTCTCCTGGTTCC^ 
GTACTACATGCAATACTTGGCTGCCACTGCTG 

ATACCTGTGGTCTCTACACCGGCTCCCGCCCCTATACACAACCAGTTTCCGGCAGAAAACCAGCCGGCCA 

ATCAGAATGCAGCCGCTCAAGCGGTTGTTAATC 
25 CGGCCCTCTGGTGGAAGAAGATGATGAGATAAACCGAGACTGGTTGGATTGGACCTACTC^GC^GCGA^ 

TTTTCCGTTTTCCTCAGCATTCTTTACTTCTACrCCTCCCTGAGCAGATT 

COTTAGTTCATGTACCTGCACCACGTCGGGTGGTTTCCATTCAGACAGAGGCCAGTTCAGAACOT 

TGACGGTCCCCCTCAGGAAGCn^CCAACC^GGACCCCAACAATAACCT 

ATGGAAGACCCCAACCGCCTC£Ca3TAGGCCGTGAAG^^ 
30 TGAGCAC&GCATGGCTAGTCrTCAAGACTT'T 

AAACTGATGGCCCCTGTGCTCTGTTGCTGGAGGCTTTC^ 

CTCGAGAGAGTCATTGAAAACCC^CAGGATGACGATGTGCTTC 

CATGAAGCCGTGGTAC^VAACTGAACAGGGC CCCTCAT.'GTCGTTATT CTGAAGAGCTTTAATGTATACTGT 
ATGTAGTCTCATAGGCACTGTAAACAGAAGGCCCAGGGTCGCATGTTCTGCCT 
35 TGTGTGCATGTGTGCCGTACATGGAAGTCATAGACGTGTGTGCATGTGTGCTCTACATGGAAGTCATAGA 
TGCAGAAACGGTTCTGCTGGTTCGATTTGATTCCTGTTGGAATGTTGCAATTACA 
TATATAATCAGTGACTTGCTAGACATGTTAGCAGGACTTTTCTAGGAG^ 

TTAAAACGCAGTGCTTACTTACTGAGGGCGGCGACTTGGCACAGGTAAAGCCTTTGCCGGGTTTTCTGTT 
CAATAAAGTTTTGCTATGAACGACAAAAAAAAAAAAA 

40 

Rat HERPUD1 Protein sequence (public gi: 16758962) 

MEPEPQPEPVTLLVKSPNQRHRDIiELSGDRGWSVSRLKAHLSRW 

dllpkqekrhvlhlvcnvrs pskkpeastrgaesteqpdntsqaqypgdsssdglrerevlrnlp PS gwe 

WSRPEAVQ(^FQGIiGPGFSGYTTYGWLQIjSWF(^I YARQY YMQYIiAATAASGAFGPTPSAQEI PVVSTP 
45 APA P I HNQF PAENQ PANQNAAAQAVV1^GANQNI«RMNAQGGPIj VEEDDE I NRDWLDWT YS AATFS VFLS I 
LYFYSSLSRFLMVMGATVVMYIiHHVGWFPFRQRPVQNFPDTC 
PVGREVLDPEHTSPSFMSTAWLVFKTFFASLLPEGPPAIiAN 



50 Mouse HERPUD1 mRNA sequence (public gi: 11612514) 

AAAGACGCCAAGTGTCGTTGTGTGGTCTCAGACGGCT 
TCGAGCCGCCAGCGACGCAGACATGGAdCCCGAGCCACAGC 

CCCAATCAGCGCC^CCGCGACTTCGAGCTGAGTGGCGACCGCAGTTGGAGTGTGAGTCGCCT 
ACCTGAG CCGAGTCTACCCCGAGCGCCCGCX5TCCAGAGGAC CAGAGGTTAATTTATTCTGGGAAGCTG CT 
55 GTTGGATCACCAGTGTCTCCAAGATTTGCTTCCAAAGCAGGAAAAGCGACATGT 
AATGTGAAGAATCCCTCCAAAATGCCAGAAACC^GCACAAAGGGT^ 

ACTCTAATCAGACACAGCATCCTGGGGACTCCTCAAGTGATGGTTTACGGCAAAGAGAAGOT 
CCTTTCTCC CTCCGGATGGGAGAACATC TCTAGGCCTGAGG CTGTC CAGCAGACTTTCCAAGG CCTGGGG 
^ CCTGGCTTCTCTGGCTACACAACGTATGGGTGGCTGCAGCTCT 

GATACCTGTGGTCTCTACACCTGCTCCGGCTCCTATACACAACC^ 

AATCAGAATGCAGCTG C TCAAG CGGTTGTCAATCCCGGAGCCAATCAGAACTTG CGGATGAATG CACAAG 
GTGGCCCCCTGGTGGAGGAAGATGATGAGATAAACCGAGACTGG 
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GTTTTCTGTTTTCCTCAGCATCCrTTACTTCTACTCCT 

ACTGTAGTCATGTACCTGCACCACGTCGGGTGGTTTCCGTTCAGACA.GAGGCCAGT^ 
ATGATGGTGGTCCTCGAGATGCTGCCAACCAGGACCCCAACAATAACCTCCA^ 

AATGGAAGACCCCAACCGCCTCCCCCCAGACCGCGAAGTGCTGGACCCTGAGCACACCAGCCCCTCGTTT 
5 ATGAGCACAGCATGGCTAGTCTTCAAGACTTTCTTTG 

CQVACTGATGGCCCTTGTGCTCTGTCGCTGGTGGCTTTGACAGCTCGGACTGGA 
CCTTTTCCTCCCCTGGCGTGGACTCGACAGAGTC^^ 

AAGCAAAAG CACAAACTAAGACATGAAG CCGTGGTA CAAACTGAACAGGGCCCCTCATGT CGTTATTCTG 
AAGAGCTTTAATGTATACTGTATGTAGTTTCATAGGCACTGT 
10 GCCTGAGCACCTCCCCAGATGTGTGTGC^TGTGTGCTGTAC7VTG<3AAGTCATAGACGTGTGTGCATGTGT 
GCTCTACATGGAAGTCATAGATGCAGAAACGGTTCTGCTGGTTCGATTTGATTCCTO 
ATTAC^CTAAGTGTACTACTTTATATAATCAGTGAATTGCTAGACATGTTAGCAGGACTTTTCTAGGAGA 
GACTTATGTATAATTGCTTTTTAAAATGCA^^ 

AAAACCTTTGCCGAGTTTTCTGTTCAATAAAGTTTTGCTATGAATGACTGT 

15 

Mouse HERPUD1 Protein sequence (public gi: 11612515) 

MEPEPQPEPVTLLVKSPNQRHRDLELSGDRSWSVSRIiKAHL^ 
DLLPKQEKRHVLHLVCNVKNPSKMPETSTKGAESTEQPD^ 

NISRPEAVQQTFQGLGPGPSGYTTYGWLQLSWFQQIYARQYYMQYLAATAASGTFVPTPSAQEIPWSTP 
20 APAPIHNQFPAENQPANQNAAAQAVWPGANQNLRMNAQGGPLVEEDDEINRDWIiDWTYSAATPSVF 
I*YF YS SLSRFLMVMGATVVMYLHHVGWFPFRQRP VQNF PDDGGPOT 
PPDREVU3PEHTSPSFMSTAWLVFKTFFASI*liPEGPPALAN 



25 INCORPORATION BY REFERENCE 

All publications and patents mentioned herein are. hereby incorporated by 
reference in their entirety as if each individual publication or patent was specifically 
and individually indicated to be incorporated by reference. In case of conflict, the 
present application, including any definitions herein, will control. 

30 

EQUIVALENTS 

While specific embodiments of the subject applications have been discussed, 
the above specification is illustrative and not restrictive. Many variations of the 
applications will become apparent to those skilled in the art upon review of this 
35 specification and the claims below. The full scope of the applications should be 
determined by reference to the claims, along with their -full scope of equivalents, and 
the specification, along with such variations. 
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What Is Claimed; 

1 . An isolated, purified or recombinant complex comprising a POSH polypeptide 
and a POSH-AP. 

5 2. The complex of claim 1, wherein the POSH-AP comprises a polypeptide selected 
from the group consisting of: selected from the group consisting of: an UNC84, an 
MSTP28 and an HERPUD1 . 

3. An isolated, purified or recombinant complex comprising 
10 a) a polypeptide comprising a domain that is at least 90% identical to a 

POSH SH3 domain; and 

b) a POSH-AP comprising a polypeptide selected from the group consisting 
of: an UNC84, an MSTP28 and an HERPUD1 . 

15 4. A method of identifying an antiviral or anti-apoptotic agent, the method 

comprising identifying a test agent that disrupts a complex of any of claims 1-3. 

4A. A method of identifying an agent to treat a neurological disorder, the method 
comprising identifying a test agent that disrupts a complex of any of claims 1-3. 

20 

4B. The method of claim 4A, wherein the neurological disorder is a CNS disorder. 

4C. The method of claim 4B, wherein the CNS disorder is selected from the group 
consisting of: Alzheimer's disease, cerebral vascular disease and schizophrenia. 

25 

5. A method for identifying an antiviral or antiapoptotic agent comprising: 

a) providing a POSH-AP polypeptide and a test agent; and 

b) identifying a test agent that binds to the POSH-AP polypeptide. 

30 5 A. A method for identifying an agent to treat a neurological disorder, comprising: 

a) providing a POSH-AP polypeptide and a test agent; and 

b) identifying a test agent that binds to the POSH-AP polypeptide. 
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5B. The method of claim 5 A, wherein the neurological disorder is a CNS disorder. 

5C. The method of claim 5B, wherein the CNS disorder is selected from the group 
5 consisting of: Alzheimer's disease, cerebral vascular disease and schizophrenia. 

6. The method of claim 5 or 5A, wherein the POSH-AP is selected from the group 
consisting of: an UNC84, an MSTP28 and an HERPUD1. 

10 7. A method of identifying an agent with anti-apoptotic or anti-viral activity, the 
method comprising: 

a) contacting a POSH-AP polypeptide with a test agent, and 

b) identifying a test agent that modulates a POSH-AP activity. 

15 7A. A method of identifying an agent with. activity against the progression of a 
neurological disorder, the method comprising: 

a) contacting a POSH-AP polypeptide with a test agent, and 

b) identifying a test agent that modulates a POSH-AP activity. 

20 7B. The method of claim 7A, wherein the neurological disorder is a CNS disorder. 

7C. The method of claim 7B, wherein the CNS disorder is selected from the group 
consisting of: Alzheimer's disease, cerebral vascular disease and schizophrenia. 

25 8. The method of claim 7 or 7A, wherein the POSH-AP is selected from the group 
consisting of: an UNC84, an MSTP28 and an HERPUDL 



9. A method for identifying an antiviral or anti-apoptotic agent comprising: 
providing a POSH-AP and a test agent; and 
30 identifying a test agent that interacts with the POSH-AP. 

9A. A method for identifying an agent to treat a neurological disorder, comprising: 
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providing a POSH-AP and a test agent; and 
identifying a test agent that interacts with the POSH-AP. 

9B. The method of claim 9A, wherein the neurological disorder is a CNS disorder. 

5 

9C. The method of claim 9B, wherein the CNS disorder is selected from the group 
consisting of: Alzheimer's disease, cerebral vascular disease and schizophrenia. 

10. The method of claim 9 or 9A, wherein the POSH-AP is selected from the group 
10 consisting of: an UNC84, an MSTP28 and an HERPUD1 . 

1 1 . An isolated antibody, or fragment thereof, specifically immunoreactive with an 
epitope of a sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
No: 26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, and SEQ ID NO:30, 

1 5 which antibody disrupts the interaction between a polypeptide of SEQ ID NO: 2 and 
a POSH-AP. 

12. A method of inhibiting viral infection comprising administering an agent to a 
subject in need thereof wherein said agent inhibits the interaction between a POSH 

20 polypeptide and a POSH-AP. 

12 A. A method of inhibiting the progression of a neurological disorder, comprising 
administering an agent to a subject in need thereof wherein said agent inhibits the 
interaction between a POSH polypeptide and a POSH-AP. 

25 

12B. The method of claim 12A, wherein the neurological disorder is a CNS 
disorder. 

12C. The method of claim 12B, wherein the CNS disorder is selected from the 
30 group consisting of: Alzheimer's disease, cerebral vascular disease and 
schizophrenia. 
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13. The method of claim 12 or 12A, wherein the POSH-AP is selected from the 
group consisting of: an UNC84, an MSTP28 and an HERPUD1. 
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ABSTRACT 

The application discloses novel polypeptides and nucleic acids involved in a 
variety of biological processes, including viral reproduction. Related methods and 
compositions are also described. 
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Figure 1 : Human POSH Coding Sequence (SEQ ID NO:l) 

ATGGATGAAT^GCCTTGTTGGATCTTTTGGAGTGTCCGGTGTGTCT 
AGGTCTTGCCTTGCCAGCATACGTTTTGCAAGC^ 

CAGATGTCCCGAGTGCAGGACTCTTGTTGGCTCGGGTGTCGAGGAGCTTCCCAGTAACATCTTGCTGGTC 
5 AGACTTCTGGATGGCATOVAACAGAGGCCTTGGAAACCT^ 

CAAATG CATTAAGGTCTCAGAGCAGCACTGTGGCTAATTGTAGCT CAAAAGATCTGCAGAGCTCC CAGGG 

CGGACAGCAGCCTCGGGTGCAATCCTGGAGCCCCCCAGTGAGGGGTATACC^ 

GCGTTATACAACTATGAAGGAAAAGAGCCTGGAGACCTTAAATTCAGCAAA^ 

GAAGACAAGTGGATGAAAATTGGTAC(^TGGGGAAGTCAATGGAATCCATC 
1 0 TGTGCAGATTATTAAACCXJTTACCTC^GCCCCCACCTCZAGTGCAAAGCA 

GACAAGGAAGCAGACAAAGATTGCCTTCCATTTGCAAAGGATGATGTTCTGAC 

ATGAAAACTGGGCTGAAGGAATGCTGGCAGACAAAATAGGAATATTTCCAATTTCA 

CTCGGCTGCTAAGCAGCTGATAGAATGGGATAAGCCTCCTGTGCCAGGAGTTGATGCT^ 

TCGGCAGCAGCCCAGAGCAGCACTGCCCCAAAGCACTCCG^ 
1 5 CCTTC^CTTCCCTCACTATGGCCAACT^GTC^ 

CCCCCCTGTCCT(^TCAGCTCCAGC^UVCCCCACTGCTGCTGCACGGATCAGCGAGCTGTCTGGGCTCT 

TGCAGTGCCCCTTCTCAGGTTCATATAAGTACCACCGGGTTAATTGTGACCCCGCCCCO^GCAGCCCAG 

TGACAACTGGCCCCTC^TTTACTTTCCCATCAGATGTTCCCTACCAAGCT 

TCCTCTTCCACCACCCCCTCTCCTGGCTGCCACTGTCCTTGCCTCCACACCACCA 
20 GCTGC^GCrTGCTGGAATGGGACCX^GGCCCATGGCAGGATCCACTGA 

AGACTCGCCCCAGTGTGTATGTTGCTATATATCCATACACTCCT 

AAAAGGGGAGATGTTTTTAGTGTTTGAGCGCTX3CCAGGATGGCTGGTO * 
AGCAAGATAGGGGTTTTCCCTGGCAATTATGTGGCACCAGTCA 
^ CTAAAGTCCCTATGTCTACAGCTGGCCAGACAAGTCGGGGAGTGACCATGGTCAG^ 

TCAGCAGCTCACATCCAGACAAGTCCTCAGGCTAAGGTCTTGTTC 
ACCAGGCCCGCAATGCTGTGAGGACAGTTGCAGCGCACAACC^ 

CATCCAGGTACAGAATGCCGCCGGCCTCAGCCCTGCATCTGTGGGCCTGTCCCATCACT 
CCACAACCTGCGCCTCTGATGCCAGGCTCAGCCACGCACACTGCTGC^ 
30 CCCCTCTGGCCTGTGCAGCAGCTGCTCCACnt^CTrCCCCAAGCATCA 

GCCCAGTGGCCGGATAGTGACCGTTCTCCCTGGACTCCCCACATCTCCTGACAGTGCTTCAT^ 
GGGAACAGTTCAGCAACCAAAC CAGACAAGGATAGCAAAAAAGAAAAAAAGGGTTTGTTGAAGTTGCT^ 
CTGGCGCCTCCACTAAACGGAAGCCCCGCGTGTCTCCT 

CAGTGCAGAGCTTCCTCTCC^GGGAGCGGTGGGGCCCGAACTGCCACCAGGAGGTGGCCATC 
35 GGCTCCTGCCCTGTGGACGGGGACGGACCGGTCACGACTGCAGT^ 
CTTTTCATAGGAAGGQLAGTTCCCTGGACTCC^ 
CTCCCTGGGTCCTGTCTTGAATGAGTCTAGACCT 

ATGGCTGGTTCAAAGG CACATTACAACGTAATG GGAAAACTGGC CTTTT 
40 CATATGA 



Figure 2: Human POSH Amino Acid Sequence (SEQ ID NO:2) 



MDESAIJiDLLECPVCIiERLDASAKV^PCQHTFC 
RIJ^GIKQRPWKPGPGGGSGTOCmAIJtSQSSTVANC^^ 

ALYNYEGKEPGDLKFSKGDI I ILRRQVDENWYHGEVNGIHGFFPTNFVQI IKPLPQPPPQCKALYDFEVK 
DKEADKD CL PFAKDDVLiTVT RRVDENWAEGMLADKX GX FP I S YVEFNSAAKQLI EWDKPPVPGVDAGECS 
S AAAQS STAPKHSDTKKNTKKRHS FTSLTMANKSSQASQNRHSME ISP PVLI SS SNPTAAAR I S ELSGLS 
CSAPSQVHISTTGLIWPPPSSPVTTGPSFTFPSDVPYQAAIiGTI^PPL^ 
AAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRKEDELEIJUCGEMFL^ 

SKIGVF PGN YVAFVTRAVTNAS QAKVPMSTAGQTS RGVTMVS P S TAGGP AQKLQGNG VAGS P SWPAAW 
S AAH I QTSPQAKVLLHMTGQMTVNQARNAVRTVAAHNQE RPTAAVTP I Q VQNAAGLS P A S VGI*SHHS LAS 
PQPAPLMPGSATHTAAISISRASAPLAOU^PLTSPSITSASLEAEPSGRIVTVLPGLPTSPDSASSAC 
GNSSATKPDKDSKKEKKGLLKLLSGASTKRKPRVSPPASPTLEVELGSAEI*PLQGAVGPELPPGGGHGRA 
GSCPVDGDGPVTTAVAGAALAQDAFHRKAS SIiDSAVP I APP PRQACS SLGP VLNE SR PWCERHRVWS Y 
P PQS EAELELKEGDI VFVHKKREDGWFKGTIiQRNGKTGLFPGSFVENI 



Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 

CTGAGAGAC^CTGCGAGCGGCX^GCGC 

CGCGAACAAAGAGGAGGAG C CGAGG CGCGAGAGCAAAGTCTGAAATGGATGT TACATGAGTCATTTTAAG 
5 GGATGCACACAACTATGAACATTTCTGAAGATTTTTTCTCAGT^^ 

CTTG TTGGATCTTTTGGAGTGT CCGGTGTGTCTAGAGCGCC TTGATG CT TCTGCGAAGGTCH^GCCTTGC 
CAGCATACGTTTTG CAAGCGATGTTTG CTGGGGATCGTAGGTTCTCGAAATGAACTCAGATGTCCCGAGT 
GCAGGACT CTTGTTGGCTCGGGTGTCGAGGAGCTTCCCAGTAACATCTTGCTGGTCAG AC TT CTGGATGG 

1 0 TCTCAGAGCAGCACTGTGGCTAATTGTAGCT 

GGGTGCAATCCTGGAGCCCCCGAGTGAGGGGTATACCT 

TGAAGGAAAAGAGCCTGGAGACCTTAAATTCAGCAAAGGCX^CATCATCATTTTG 
GAAAATTGGTACCATGGGGAAGTCAATGGAATCCATGGCTTTTTCC 

AACCGTTACCTCAGCCCCCACCTCAGTGCAAAGC^CTTTATGACTTTGAAGTGAAAGACAAGGAAG 
1 5 CAAAGATTGCCTTCCATTTGCAAAGGATGATGTTCrGACTGTGATCCGAAGAGTGGAT^ 
QAAGGAATGCTGGCAGACAAAATAGGAATATTTCCAATT^ 

AGCTGATAGAATGGGATAAGCCTCCTGTGCCAGGAGTTGATGCTGGAGAATGTTCCTCGGCAGCAGCCCA 
GAGCAGCACTGCCCCAAAGC^CTCCGACACCAAGAAGAACACCAAAAAGCGGCACT 
ACTATGGCCAACAAGTCCTCCC^GGCATCC 
20 TCAGCTCCAGCAACCCCACTGCTGCTGCACGGATCAGCGAGCTGTCTGGGCTCTCCTGCAGTG 

TCAGGTTCATATAAGTACCACC^GGTTAATTGTGACCCCGCCCCCAAGCAGCCC^GTGACAACTGGCCCC 

TCGTTTACTTTCCCATCAGATGTTCCCTACCAAGCTGCCCT 

CCCCTCTCCTGGCTGCCACTGTCCTTGCCTCC^C^^ 

AATGGGACCGAGGCCCATGGCAGGATCC^CTGACCAGATTGCACATTTACGGCCGCTVGACTCGCCCCAGT 
25 GTGTATCTTGCTATATATCCATACACTCCTCGGAAAGAGGATGAACTAGAGCTGAGA 

tttt agtgtttgagcgctgc caggatggctggtt caaagggacatccatg cataccagcaagataggggt 

tttccctggcaattatgtggcaccagtcacaagggcggtgacaaatgcttcccaagctaaagtccc 

tctacagctggccagacaagtcggggagtgaccatggtcagtccttc 

agctccagggaaatggcgtggctgggagtcccagtgttgtccc^ 
30 ccagacaagtcctcaggctaaggtcttc 

gctgtgaggacagttgcagcgcacaaccaggaacgccccacggcagcagtgaca 

atgccxscoggcctcagccctgcatctgtgggcct^ 

tctgatgcgaggctcagccacgcacacrgctgccatcagt^ 

gcagcagctgctccactgacttccccaagcatc^ 
35 tagtgaccgttctccctggactccccacatctcx:tgacagtgcttcatcagcttgtgggaacagttcagc 

AACCAAACCAGACAAGGATAGCAAAAAAGAAAAAAAGGGTTTO 
AAACGGAAGCCCOSCXSTGTCTCCTCCAGC^ 

CTCTCC^GGGAGCGGTGGGGCCCGAACTGCCACCAGGAGGTGGCCATGGCAGGGCAGGCTCCTGCCCTGT 

GGACGGGGACGGACCGGTCACGACTGC^GTGGCAGGAGCAGCC^ 
40 GCAAGTTCCCTGGACTCCGCAGTTCCCATCGCTCCACCTC 

TCTTdAATGAGTCTAGACCTGT CGTTTGTGAAAGGCACAGGGTGGTGGTTTCCTATC CTCCTCAGAGTGA 

GGCAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTC 

GGCACATTACSUVCGTAATGGGAAAACTGGCCTTT^ 

GACACTGAAGAAGCTTAAAATCACTTCACACAACAAAGTAGC^ 
45 TTGTGGACTTCCAGATXXSTCAGGAGATG^ 

CCCCAGCGAGCAGAGTGAAGAAGATGTTTGTGTGGGTTTTGTTA^ 

CCrTGTACTGTCTGATTTACTACACAGAGAAACTTTTTTTTTTTTTTA 

ATTGTTTACAAGGCTTAACTAATTTATTTGCTTTT^ 

TTGGATTATGATTTTAAGAAATTATTAATTTATGAAATGATAGGTAAGGAGAAGCT 

ATTTTGGGGTTATGTTTTGCTTCTTTAAGATAGAAATCCCAGTTCTOT CTTTGGGA 
AACCAAACATACAAATGAATCAGTATCAATTAGGGCCTGGGGTAGAGAG 

TGGGTTTTTAATTTTCCTAGAATGAAGTGACTGAAACAA 
55 AATGTATTTAGAAATATATTTAGTTTTATAGCAGAA 

TTGAAGTTGTAGTCACTGTCTGAGAATGGCTATGAAGCGTCATTTCACATTTTA^ 

TGCCCAGGACACAAGTAAAACATTT GTGAGATAGTGGTGGTAAGTGATG CACTCGTGTTAAGTCAAAGGC 
TATAAGAAACACTGTGAAAAGTTCATATTCATCCAT^ 

GGATTCCCAO^GTAATATAGACTGTGCATGGTGTGTATATTTCATTGCGATTTCCTGTTAA 
60 GTACTCAGAArrGACCAATTCAGGAGGTGTAAAAATAAACAGTGTTCTCrrCTCTA 

CTGAC CAAGGTCTCTTCAGTGCACTCGCTCCCTCTC TGGCTAAGGCATG CATTAGCCACTACACAAGTCA 

TTAGTGAAAGTGGTCTTTTATGTCCTCCCAGCAGAGAGA^ 

CTGTGACTGTGGAGCTCTGGAAGGCTTGGTGGGAGTGAAT^ 

CAGAAGAGCCTGTCTTTTTATATCCATTCCTTGA 
65 CACGCAGTCATGGATCTGGGTAGrCCGGAAAACAAAAGGA 



TTAC CACAGTTTTCTCATGGGAAAT ACATAATAAACCCTTT CATCTTTTTTT TT TT CCTTTAAGAATT AA 
AACTGGGAAATAGAAACATGAACTGAAAAGTCT^ 

TTT ATATGGTTGAAGATGAAATCATT C CTAAATTAACCTTTTTTTT AAAAAAAAACAATGTATAT TATGT 
TCCTGTGTGTTGAATTTAAAAAAAAAAAATACTTTACTTC 
5 AAAT G AACTTTAGT TAGGAAAAAG CTGGCAT CAG CTTTCATCTGTGTAAGTT GACACCAATGTGTCATAA 
TATTCTTTATTTTGGGAAATTAGTGTATTTTATAAAAA 
QATA ATlTl M l M r rA 

CTGGTATAAAGTTGGTT AAAATTT CATCTGTTAATAGATC^TTAGGTAATATAATGT ATGGG" ±~rrTCTAT 
TGGTTTTTTGCAGA CAGTAGAGGGAGATTT TGTAACAAGGGCTTGTTACACAGTGAT ATGGTAATGAT AA 
1 0 AATTG CAATTTAT CACT CCTTTT CATGTTAATAATTTGAGGACTGGATAAAAGGTTT CAAGATTAAAATT 
TGATGTTCAAACCTTTGT 



Figure 4: 5* cDNA fragment of human POSH (public gi:10432611; SEQ IDNO:4) 

ctgagagacactgcgagcggcgagcgcggtggggccgcatctgcatcagccgccgcagccgctgcggggc 
cgcgaacaaagaggaggagccgaggcgcgagagcaaagtctgaaatggatgttacatgagtcattttaag 
5 gatgcacacaactatgaacatttctgaagattttttctcagtaaagtagataaagatggatgaatcagcc 
ttgttggatcttttggagtgtccggtgtgtctagagcgccttgatgcttctgcgaaggtcttgccttgcc 
agcatacgttttgcaagcgatgtttgctggggatcgtaggttctcgaaatgaactcagatgtcccgagtg 
caggactcttgttggctcgggtgtcgaggagcttcccagtaacatcttgctggtcagacttctggatggc 
atcaaacagaggccttggaaacctggtcctggtgggggaagtgggaccaactgcacaaatgcattaaggt 

10 ctcagagcagcactgtggctaattgtagctcaaaagatctgcagagctcccagggcggacagcagcctcg 
ggtgcaatcctggagccccccagtgaggggtatacctcagttaccatgtgccaaagcgttatacaactat 
gaaggaaaagagcctggagaccttaaattcagcaaaggcgacatcatcattttgcgaagacaagtggatg 
aaaattggtaccatggggaagtcaatggaatccatggctttttccccaccaactttgtgcagattattaa 
accgttacctcagcccccacctcagtgcaaagcactttatgactttgaagtgaaagacaaggaagcagac 

15 aaagattgccttccatttgcaaaggatgatgttctgactgtgatccgaagagtggatgaaaactgggctg 
aaggaatgctggcagacaaaataggaatatttccaatttcatatgttgagtttaactcggctgctaagca 
gctgatagaatgggataagcctcctgtgccaggagttgatgctggagaatgttcctcggcagcagcccag 
agcagcactgccccaaagcactccgacaccaagaagaacaccaaaaagcggcactccttcacttccctca 
ctatggccaacaagtcctcccaggcatcccagaaccgccactccatggagatcagcccccctgtcctcat 

20 cagctccagcaaccccactgctgccgcacggatcagcgagctgtctgggctctcctgcagtgccccttct 
caggttcatataagtaccaccgggttaattgtgaccccgcccccaagcagcccagtgacaactggcccct 
cgtttactttcccatcagatgttccctaccaagctgcccttggaactttgaatcctcctcttccaccacc 
ccctctcctggctgccactgtccttgcctccacaccaccaggcgccaccgccgccgctgctgctgctgga 
atgggaccgaggcccatggcaggatccactgaccagattgcacatttacggccgcagactcgccccagtg 

25 tgtatgttgctatatatccatacactcctcggaaagaggatgaactagagctgagaaaaggggagatgtt 
tttagtgtttgagcgctgccaggatggctggttcaaagggacatccatgcataccagcaagataggggtt 
ttccctggcaattatgtggcaccagtcacaagggcggtgacaaatgcttcccaagctaaagtccctatgt 
ctacagctggccagacaagtcggggagtgaccatggtcagtccttccacggcaggagggcctgcccagaa 
gctccagggaaatggcgtggctgggagteccagtgttgtccccgcagctgtggtatcagcagcncacatc 

30 cagacaagtcctcaggctaaggtcttgttgcacatgacggggcaaatgacagtcaaccaggcccgcaatg 
ctgtgaggacagttgcagcgcacaaccaggaacgccccacggcagcagtgacacccatccaggtacagaa 
tgccgccggcctcagccctgcatctgtgggcctgtcccatcactcgctggcctccccacaacctgcgcct 
ctgatgccaggctcagccacgcacactgctgccatcagtatcagtcgagccagcgcccctctggcctgtg 
cagcagctgctccactgacttccccaagcatcaccagtgcttctctggaggctgagcccagtggccggat: 

35 agtgaccgttctccctggactccccacatctcctgacagtgcttcatcagcttgtgggaacagttcagca 
accaaaccagacaaggatagc 



Figure 5: N terminus protein fragment of hPOSH (public gi:10432612; SEQ ID 
NO:5) 

^ESALLDLLECPVC^ERLDASAKVLPCQHTFCKRCI^ 
5 RLLDGI KQRPWKPGPGGGSGTNCTWALRSQSSTV^CSSKDLQSSQGG^QPRVQSWSPPVRGIPQLPCAK 
ALYNYEGKEPGDLKPSKGDI IIIjRRQVDENWYHGEVNGIHGFFPTNFVQI IKPLPQPPPQCKALYDFEVK 
DKEADKDCXPFAKDDVLTVI RRVDENWAEGMIiADKIGIFPI S YVEFNSAAKQLI EWDKPPVPGVDAGECS 
SAAAQSSTAPKHSDTKKIJTKKRHSFTSLTMANKSSQASQNRHSMEISPPVIjISSSNPTAAARISELSGLS 
CSAPSQVHISTTGLrCTPPPSSPVTTOPSFTFPSDVPYQAAL^^ 
10 AAAAGMGPRPMAGSTDQIJ^RPOTRPSVYVAIYPyTPRKEDELELIaCGEMFLVFERCQDGWFKGTS^^ 
S KIGVF PGN YVA PVTRAVTNASQAKVPMS TAGQTS RGVTMVS P S TAGGPAQKLQGNGVAGS P SW P AA W 
S AAH I QTS P QAKVLLHMTGQMTVNQARNAVRTVAAHNQERPTAA VTP I QVQNAAGLS PASVGLS HH S LA S 
PQPAPLMPGSATHTAAISISRASAPLACAAAAPLTSPSITSASLEAEPSGRIVTVIiPGLPTSPDSASSAC 
GNSSATKPDKDS 



Figure 6: 3* mRNA fragment of hPOSH (public gi:7959248; SEQ ID NO:6) 

atttcatatgttgagtttaactcggctgctaagcagctgatagaatgggataagcctcctgtgccaggag 
ttgatgctggagaatgttcctcggcagcagcccagagcagcactgccccaaagcactccgacaccaagaa 
gaacaccaaaaagcggcactccttcacttccctcactatggccaacaagtcctcccaggcatcccagaac 
cgccactccatggagatcagcccccctgtcctcatcagctccagcaaccccactgctgctgcacggatca 
gcgagctgtctgggctctcctgcagtgccccttctcaggttcatataagtaccaccgggttaattgtgac 
cccgcccccaagcagcccagtgacaactggcccctcgtttactttcccatcagatgttccctaccaagct 
gcccttggaactttgaatcctcctcttccaccaccccctctcctggctgccactgtccttgcctccacac 
caccaggcgccaccgccgctgctgctgctgctggaatgggaccgaggcccatggcaggatccactgacca 
gattgcacatttacggccgcagactcgccccagtgtgtatgttgctatatatccatacactcctcggaaa 
gaggatgaactagagctgagaaaaggggagatgtttttagtgtttgagcgctgccaggatggctggttca 
aagggacatccatgcataccagcaagataggggttttccctggcaattatgtggcaccagtcacaagggc 
ggtgacaaatgcttcccaagctaaagtccctatgtctacagctggccagacaagtcggggagtgaccatg 
gtcagtccttccacggcaggagggcctgcccagaagctccagggaaatggcgtggctgggagtcccagtg 
ttgtccccgcagctgtggtatcagcagctcacatccagacaagtcctcaggctaaggtcttgttgcacat 
gacggggcaaatgacagtcaaccaggcccgcaatgctgtgaggacagttgcagcgcacaaccaggaacgc 
cccacggcagcagtgacacccatccaggtacagaatgccgccggcctcagccctgcatctgtgggcctgt 
cccatcactcgctggcctccccacaacctgcgcctctgatgccaggctcagccacgcacactgctgccat 
cagtatcagtcgagccagtgcccctctggcctgtgcagcagctgctccactgacttccccaagcatcacc 
agtgcttctctggaggctgagcccagtggccggatagcgaccgttctccctggactccccacatctcctg 
acagtgcttcatcagcttgtgggaacagttcagcaaccaaaccagacaaggatagcaaaaaagaaaaaaa 
gggtttgttgaagttgctttctggcgcctccactaaacggaagccccgcgtgtctcctccagcatcgccc 
accctagaagtggagctgggcagtgcagagcttcctctccagggagcggtggggcccgaactgccaccag 
gaggtggccatggcagggcaggctcctgccctgtggacggggacggaccggtcacgactgcagtggcagg 
agcagccctggcccaggatgcttttcataggaaggcaagttccctggactccgcagttcccatcgctcca 
cctcctcgccaggcctgttcctccctgggtcctgtcttgaatgagtctagacctgtcgtttgtgaaaggc 
acagggtggtggtttcctatcctcctcagagtgaggcagaacttgaacttaaagaaggagatattgtgtt 
tgttcataaaaaacgagaggatggctggttcaaaggcacattacaacgtaatgggaaaactggccttttc 
ccaggaagctttgtggaaaacatatgaggagactgacactgaagaagcttaaaatcacttcacacaacaa 
agtagcacaaagcagtttaacagaaagagcacatttgtggacttccagatggtcaggagatgagcaaagg 
attggtatgtgactctgatgccccagcacagttaccccagcgagcagagtgaagaagatgtttgtgtggg 
ttttgttagtctggattcggatgtataaggtgtgccttgtactgtctgatttactacacagagaaacttt 
tttttttttttaagatatatgactaaaatggacaattgtttacaaggcttaactaatttatttgcttttt 
taaacttgaacttttcgtataatagatacgttctttggattatgattttaagaaattattaatttatgaa 
atgataggtaaggagaagctggattatctcctgttgagagcaagagattcgttttgacatagagtgaatg 
cattttcccctctcctcctccctgctaccattatattttggggttatgttttgcttctttaagatagaaa 
tcccagttctctaatttggttttcttctttgggaaaccaaacatacaaatgaatcagtatcaattagggc 
ctggggtagagagacagaaacttgagagaagagaagttagtgattccctctctttctagtttggtaggaa 
tcaccctgaagacctagtcctcaatttaattgtgtgggtttttaattttcctagaatgaagtgactgaaa 
caatgagaaagaatacagcacaacccttgaacaaaatgtatttagaaatatatttagttttatagcagaa 
gcagctcaattgtttggttggaaagtaggggaaattgaagttgtagtcactgtctgagaatggctatgaa 
gegtcatttcacattttaccccaactgacctgcatgcccaggacacaagtaaaacatttgtgagatagtg 
gtggtaagtgatgcactcgtgttaagtcaaaggctataagaaacactgtgaaaagttcatattcatccat 
tgtgattctttccccacgtcttgcatgtattactggattcccacagtaatatagactgtgcatggtgtgt 
atatttcattgcgatttcctgttaagatgagtttgtactcagaattgaccaatteaggaggtgtaaaaat 
aaacagtgttctcttctctaccccaaagccactactgaccaaggtctcttcagtgcactcgctccctctc 
tggctaaggcatgcattagccactacacaagtcattagtgaaagtggtcttttatgtcctcccagcagac 
agacatcaaggatgagttaaccaggagactactcctgtgactgtggagctctggaaggcttggtgggagt 
gaatttgcccacaccttacaattgtggcaggatccagaagagcctgtctttttatatccattccttgatg 
tcattggcctctcccaccgatttcattacggtgccacgcagtcatggatctgggtagtccggaaaacaaa 
aggagggaagacagcctggtaatgaataagatccttaccacagttttctcatgggaaatacataataaac 
cctttcatctttttttttttcctttaagaattaaaactgggaaatagaaacatgaactgaaaagtcttgc 
aatgacaagaggtttcatggtcttaaaaagatactttatatggttgaagatgaaatcattcctaaattaa 
ccttttttttaaaaaaaaacaatgtatattatgttcctgtgtgttgaatttaaaaaaaaaaaatacttta 
cttggatattcatgtaatatataaaggtttggtgaaatgaactttagttaggaaaaagctggcatcagct 
t tea t c tgtg taag t tgacaccaat gtgtca taa t a t 1 c 1 1 tat 1 1 tgggaaat tag tgtat 1 1 1 a taaa 
aattttaaaaagaaaaaagactactacaggttaagataatttttttacctgtcttttctccatattttaa 
gctatgtgattgaagtacctctgttcacagtttcctggtataaagttggttaaaatttcatctgttaata 
gatcattaggtaatataatgtatgggttttctattggttttttgcagacagtagagggagattttgtaac 
aagggcttgttacacagtgatatggtaatgataaaattgcaatttatcactccttttcatgttaataatt 
tgaggactggataaaaggtttcaagattaaaatttgatgttcaaacctttgt 



Figure 7: C teiminus protein fragment of hPOSH (public gi:7959249; SEQ ID 
NO:7) 

I SYVEPNSAAKQLIEWDKPPVPGVDAGEC^SAAAQSSTAPKHSDTKKNTKKRHS FTSLTMANKSSQASQN 
5 RHSMEISPPVLISSSNPTAAARISEI^GLSCSAPSQVHISTTGIiIVTPPPSSPVTTGPSFTPPSDVPYQA 
ALGTLNPPLPPPPLIJ\ATVIjASTPPGATAAAAAA^ 
EDELEIiRKGEMFLVFERCQDGWFKGTSMHTSKIGVFPGI 
VSPSTAGGPAQKLQGNGVAGSPSWPAAWSAAHIQTSPQAKVLIiH^^ 

PTAAVT P I QVQNAAGLSPAS VGLSHHSLAS PQ PAPLMPGSATHTAAI S I SRASAPLACAAAAPLTSPS IT 
10 SASLEAEPSGRIVTVLPGIiPTSPDSASSACGNSSATKPDKDSKKEKKGLLKLIiSGASTKRKPRVSPPASP 
TLEVELGSAEL PLQGAVGPELPPGGGHGRAGS CPVDGIX5PVTTAVAGAAIAQDAFHRKAS SLX)S AVPIAP 
PPRQACSSLGPVLNESRPWCERHRVWSYPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKTGLF 
PGSPVENI 
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Figure 8: Human POSH full mRNA, Annotated Sequence 
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gi 1 10432611 | dbj | AK021429 . 1 1 AK021429 Homo sapiens cDNA 

PLJ11367 fis, clone HEMBA1000303 , highly similar to Mus musculus 
Plenty of SH3s (POSH) mRNA 

gi | 7959248 | dbj | AB040927 . 1 1 AB040927 Homo sapiens mRNA for 

KIAA1494 protein, partial cds 

- Both hPOSH and KIAA1495 

Ring Domain 

SH3 Domian 

■ start codon and stop codon of predicted ORP 
CTGAGAGACACTGCGAGCX^CGAGCGCGGTGGGG 

CGCGAACAAAGAGGAGGAGCCGAGGCGCGAGAGCAAAGTCTGAAATGGAT^ 
GGATGCACACAACTATGAAC^TTTCTGAAGATTTTTTCTCAGTAAAGTAGATA 
CTTGTTGGATCTTTTGGAC 



GCAGGACTCTTGTTGGCTCGGGTGTCGAGGAGCTTCCCAGTAACATCTTGCTGGTC^ 
CATCAAACAGAGGCCTTGGAAACCTGGTCCTGGTGGGGGAAGTGGGACCAACTGCA 
TCTCAGAGCAGCACTGTGGCTAATTGTAGCTCAAAAGATCTGC^ 
GGGTGCAATCCTGGAGCCCCCCAGTGAGGGGTATACCTCAGTTA 




AAACX3GAAGCCCCGCGTGTCTCCTC(^GCATa5CC« 
CTCTCCAGGGAGCGGTGGGGCCCGAACTGCCArc 
GGACGGGGACGGACCGGTCACX^CTGCAGTGGCAGGAGCAGCCCT^ 
GCAAGTTCCCTGGACTCCGCAGTTCCCATCGCTCCACCTCCTCGCCAGGCCT 
TCTTGAATGAGTCTAGACCTGTCGTTTG1 



60 



^raTTTACAAGGCTTAACTAATCTATTTGCTT 

TGAGAGCMGAGATTCGTTTXGACATAGAGTGAATGCATTTTCCCCTCTCCTCCTC^^ 

aaccaaacaxacaaatgaatcagtatcaattagggcctggggtagagagacagaaacttga^^Sagaga 

^™™S^ TCT ^ CTAG ^^^ 



^G^S^ar™ TGTC ^ G ^^ TAT ^^CArTTCACATTTTACCCCAACTG^CTG« 



™^™^ CTCTCAAAAGra ^ TA ^CATCCATOGTGATTCTTTCCCCA^^^ 
™™^^ TAATATAGACTG ^^ 

™™™ GA ^ TOCAGGAOT ^ 

^™^n^^ GTG ^ CTCGCTCCCTCTCTGGCTAAGG ^^ T ^CCAC^C^ 
™ AG I GAAAG I GGTCT ™ TATCTOT ^ 

CTGTGACTGTGGAGCTCTG(^GGCTTCMTGGGAGTCAATTTGCCCACACCT 

^™ CTCTCT ^ ATATCCATOCOTGATGTCA "^CCTC^C^^^ 

CACGCAGTCATGGATCTGGGTAGTCCGGAAAACAAAAG^^ 

^™°ir TCTCATGGGWTACATAATAAACCC " 

TCCTGTG1X3XTGAATTTAAAAAAAAAAAATACTTTACTTGGATATTCATGTAATATATAA 
— It 

GMAATTTTTTTACCTOTCTTTTCTCCATATTTTAAGCTATG^ 



Figure 9: Domain Analysis of Human POSH 
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Figure 10: Diagram of Human POSH Nucleic Acids 
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Figure 1 1 : Reduction in Full Length POSH mRNA by siRNA Duplexes 
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Figure 12: POSH Affects Release of VLP from Cells 



(WT) 



(Posh+WT) 



Figure 13: Release of VLP from Cells at Steady State 



Figure 14: Mouse POSH mRNA sequence (public gi:10946921; SEQ ID NO: 8) 

GGGC^GCGGGCTCGGCGGGGCTGCATCTACCAGCGCTGCGGG^ 
GCGAGAGCAAAGTCTGAAATGGJVTGTTACATGAATCACTTT^ 
5 AAGCCGTTTTCTCACTAAAGTC^CTC^GATGGATGAGTCTGCCTTGTTGGACCTTCTGGAGTGCCCTGT 
GTGTCTAGAACGCCTGGATGCTTCCGCAAAGGTCTT^^ 
CTGGGGATTGTGGGTTCCCGG^TGAACTC^GATC^ 
ACGAGCTCCCCAGTAACATCCTACTGGTCAGACTTCTGGAT 
CCCTG^TGGGGGCGGCGGGACCACCTGCTVCAAACACATTAAGGGCGC^ 
1 0 GGCTCGAAAGATCTGCAGAGCTCCCAGTGTGGACAGCAGCCTCGGGTGCAAGCCTGGAGC 
GGGGAATACCTCAGTTACCGTGTG CCAAAGCATTATAT^ 

GTTCAGCAAAGGCGACACCATGATTCTGCGCCGACAGGTGGATGAGAATTGGTA CCACGGGGAAGT CAGC 

GGGGTCCACGGCTTTTTCCCCACTAACTTCGTGCAG^ 

GCAAAGCACTTTACGACTTTGAAGTGAAAGACAAGGAAGCn'GA 
1 5 CGACGTACTGACCGTGATCCGCAGAGTGGATGAAAACTGGGCTGAAGG^TGCTGGCAGATAAAATAGGA 

ATATTTCCAATTTCATA CGTGGAGTTTAACTCAGCTG CCAAGCAG C TGATAGAGTGGGATAAGCCTCCCG 

TGCCAGGAGTGGAC^O^CAGAATGCCCCTCAGCGA 

CACGAAGAAGAACACCAGGAAGCGACACTCCTTCACCTCCCT 

TCCCAGAACCG^CACTCCATGGAGATC&GCCCTCCTGTGCT 
20 CCCGCATCAGCGAACTGTCCGGGCTCTCCTGCAGCGCCCCGTCT 

AATTGTGACCCCACCCCCTAGCAGCCCGGTGACAACTGGC^ 

TACCAAGCTGCCCTTGXSAAGTATGAATCCTCC^^ 

CCTCCACCCCXJTCAGGCGCTACTGCTGCTGTTGCTGCTGCTGCTGCCGCCGCC^ 
ACCCAGGCCTGTGATGGX^TCCTCTGAACAGATTGCACAT^ 
25 GTTGCTATATATCCGTACACTCCCCGGAAGGAAGACGAACTGGAGCTGAGGAAAGGGGAGATGTTTTTGG 
TGTTTGAGCGTTGCCAGGACGGCTGGTACAAAGGGACATCGATGCATACCAGCAAGATAGGCGTTTTCCC 
TGGCAACTATGTGGCG^CCGTCACAAGGGCGGTGACXS^ 
GCGG^TCAGGCAAGTCGCGGGGTGACCATGCTCAGCCCT 

AAGGAAA03GCGTGGCCGGAAATCCCAGCGTCGTCCCCACX3GCTGTGGTGTCAGCAGCT 
30 AAGTCCTCAGGCTAAGGTCCTJGCTCCACA 

AGGACAGTTGCAGCACATAGCCAGGAACGCCCCACAGCAGCACTG^ 

CCTGCCTTGGTCCTG<IATCCGTGGGCCTGCCCCATCATTCTCTGGCCT 

GGGTCCTX3CTGCCCACGGTGCTGCCGTCAGCATCAGTCG 

GCTTCTCTGGCCTCCCCAAATATGACCAGTGCCATGTTG^ 
3 5 TCCTCCCTGGACTCCCCACATCTCCAGAGAGTGCTGCATC^ 

AGACAAGGACAGTAAGAAAGAAAAAAAGGGCCTACTGAAGCTGCTTTCTGGTGGCTCC^ 

CCCCGAGTCTCCCCTCCAGCATCACCrACC 

GAGC^GTAGGTCCTGAGCTGCCGCTAGGGGGCAGCCAQGGCAGAGTGG^ 
TGGTCCAGTGGCCGCTGK3AACAGCAGCCCTAGC^ 
40 TCCGCAGTGCCCATTGCTCC^CCACCT^ 

GGCCTGTTGTTTGTGAAAGGCACAGGGTGGTGGTTTCCTACCCTCCTCAGAGTGAGK3CCGAACT 

AATGGGAAGACTGGCCTTTTCCC^GGGAGCTTTGTGGAAAACATCTGAGA 
TTATC^TCACACCACGTGTGACTAAAGAGCACAAAGC^ 
45 AGATCTTCAAGAACCGAG^GAAGATGG^CACCTGACTC 

AGG^AAGGAGGAC^CACCTGTGTGGGTTCCGTCTCTCTGGGTTCTGATGTGTAAAGTCT 

T CTAATGGACTTTACAGATAAATGTC , ri''rr XTTTTT TAAGATGTATAACIAAAATGGACAATTGTTTACA 

AGGCTTAACTAATTTATTTGCTTTTTTAAAACT^ 



Figure 15: Mouse POSH Protein sequence (Public gi: 10946922; SEQ ID NO: 9) 

MDESALLDIiLECPVCLERLDASAKVLPCQHTFCKRCIiLGIVGSRNELRCPECRTLVGSGVDEIiPSNILLV 
RliLDGIKQRPWKPGPGGGGGTTCTNTIiRAQGSTVVNCGSKDLQSSQCG^ 
5 AIjYNYEGKEPGDLKFSKGDTI IliRRQ^nDENWYHGEVSGVHGFFPTNFVQI I KPLPQPPPQCKALYDFEVK 
DKEADKDCLPFAKDDVI*TVIRRVDEWV3S^GMIiADKIGI FPI S YVEFNSAAKQLIEWDKPPVPGVDTAECP 
SATAQSTSASKHPDTKKNTRKRHSFTSLTMANKSSQGSQimHSMEISPPVLISSSNPTAAARISELSGLS 
CSAPSQVHISTTGLIVTPPPSSFVTTGPAFTFPSDVPYQAAIX^ 

VAAAAAAAAAAGMGPRPVMGS S EQI AHLRPQTRPSVYVAI Y P YT PRKEDELELRKGEMFLVFERCQDGWY 
1 0 KGTSMirrSKIGVFPGNYVAPVTRAVTNASQAKVSMS TAGQASRGVTMVS PSTAGGPTQKPQGNGVAGNPS 
WPTAWSAAHI QT SPQAKVLLHMS GQMTVNQARNAVRTVAAH SQERPTAAVT P I QVQWAACLGPASVGL 
PHHS IiASQPLPPMAGPAAHGAAVS I S RTNAPMACAAGASLAS PNMTSAMIiETE PSGRTVTIIiPGLPTS PE 
S AAS ACGNSSAGKPDKDS KKEKKGLLKLLS GAS TKRKPRVS PPAS PTLDVELGAGEAPLQGAVGPEIiPLG 
GSHGRVGSCPITDGDGPVAAGTAAIAQDAFHRKXSSLDSAVPXAPP^^ 
1 5 WS YPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKTGIiFPGSFVENI 



Figure 16: Drosophila melanogaster POSH mRNA sequence (public gi: 17737480; 
SEQIDNOrlO) 



5 CATTTGTATCCGCTTGG CCACGAG CTTTGGCTG CACTTGG CAAACTT AATAAATTAAACATTGAATCCTG 
CCTATTGCAACGATAATATAATCTGATTTAGTGCAT^ 
TTAGCATTTGAGCTAAATTTATTTCCCAACCGC^^ 

GTGTGTGTGTTTTGGAATGTGGCCCTGCACX3AAATTCAAATAGTGACCATCCCT 

GCAAGATGGACGAGCACACGTTAAACGACCTGTTGGAGTG CT CCGTGTGTCTTGAGCGACTGGACACCAC 
1 0 ATCGAAGGTGCTGCCATGCCAGCACACCTTCTGCCGCAAATGCTTGCAGGACATTGTGGCCAGTCAGC^C 
AAGTTGCGATGCCCGGAGTGCCGCATCCTGGTCTCTTC 

TGATGCGAATCTTAGAAGGCATGAAACAAAATGCAGCAGCTGGCAAAGGAGAAGAAA^ 
TGAAACACAGCCGGAAAGGGCCAAACCTC^GCCGCCAGCGGAATCAGTGGCCCCGCOT 
CTCCAGCTGCAGTCACATCAGCAATCTCATCAGC(X3GCTro 

1 5 ACGCCTATGCCCTCTTTGACTTCGCCTCCGGTGAAGCCZACCGATCTAAAGTTCAAGAAAGGGGATCTGAT 
ACTGATCAAGCATCGCATCGACAACAACTGGTTTGT 
ATCAACTACGTCAAGGTATCGGTTCCGCTGCCCATGCCGCAGTC 
GGCCCAACGACGAGGAGGGATGCCTCGAATTTAAGAA 
TCATAATTGGGCAGAAGGACGAATTGGCCAGACCATCGGAATCT 

20 GCAGCGGCCAAAAAGCTGTTGGACAGQSGGCTACACACCCATCC^ 

GGCAGCGGGCCCTTCCTCCX3GTTCCAGTTATTGATCCCACGGTGGTCACGGAATCCAGTTCGGGATCCTC 
CAATTCCACGCCGGGCAGCAGCAATTCAAGCTCCAC^TCCAGCTCGAATAACTGCAGTCCGAATCACC^ 
ATCTCACTGCCGAATACC CC CCAACATGT AGTAGCT TC CGGATCGG CGTCTGTT CGTTT CCGTGACAAGG 
GAGOVAAGGAGAAATOCCACTCACTAAATGCTTTGCTGGGAGGAGGAGCTCCATTAA 

25 C^CO^mTTCGGCTGAAATTCTTAGCCTGCCCCATGAACTAAGCCGCTTGGAAGTTTCC^GCTCAA^ 
GCTCTAAAACCX^CGTCAGCCCCACAGACATCGCGTGTACTTAAGACCACTGTTCAGC^ 
CGAATTTAC CCTGGGGATACTTAGC CCTGTTCCCATACAAACCACG CCAAACGGATGAGCTGGAATTAAA 
AAAGGGTTGTGT TT ACATTGTGACCGAA CGATGTGTGGAGGGTTGGTTCAAGGGAAAAAACTGGTTGGAC 

30 AATGGAAATATGTTCCC CAAAATGCAGA CX5CCCAGATGGCACAAGTACAGCAGCATCCAGTTGCACCAGA 

TGTGCGACTCAACAACATGCTGTCCATGCAACCGCCTGATTTGCCACCTCGTCAG 

ACGACCACC^GTTGCTCTGTGTGGTCGAAACCAGTGGAGGQGCTC 

CTGAAACTG CCACAG CTTCGACTACGAGCAGCAGTTCCTCTGGAGCAGT 

TCACATGAAAACACGCTCCAAATCTCCGGGAGCGTC 
35 AATGTGGAATTTACAACAAACCCATCAGCTAAATTGCATCCAGTACATGXAAGATCCGGCTTO 

GTCAGCTGCAGCACAGTCAACCGCTCAATGAAACTCC^ 

CCTACCCAAGCAGCTGCCTTCCGCTTCTACGAAC^GCGTTTCGTACGGATCGCA 
AAGGAACGTCCTC^CTTGATTTGCGCGAGACAATCATra 

ATGCCGCGTCGCCGCCX5CCACCTACTACTTCC^TGGCCCCAGCTGTCTACGCCGGCGGTCAGCAACAGGT 
40 GATTCCTGGAGGTGGAGCGCAATCCC^GTTGCATGCCAAT^ 

CACAGCCTAGATGCGAGTCATGTGCTGAGTCCCAGCAGCAATAT^^ 
GCGCCACC^CTAAGTCTCCTTACTGCACGAGGGAAAGT 

CAGTGACATTGAACTAGAGCTAC^TTTGGGCGACATTATCTACGTCCAGCGGA 
TATAAGGGCACCCATGCCCX3TACCCACAAAACCGGGCTGTTCCC 
45 AGGAAAGTTATGGTTCAAACTAGAATITATTAAGCGAAAT^ 
GTCGGTCTATTCGGGCTTCCAAATACGCAATCT^ 

ctt ccaatcgaatg ggcag ctcgccgttgtactttttt^ 

taagacttagggaacagttacttaagccttagcgattagttagctagagaaataatctaac 
tgccctctacaaagttatttgtaatatacgatact^^ 



Figure 17: Drosophila melanogaster POSH protein sequence (public girl 7737481; 
SEQIDNO:ll) 

OTEMTLNDLLECSVCLERLDTTSKVIiPC^HTFCRKC^ 
5 RILEGMKQNAAAGKGEEKGEBTETQPERAKPQPPAESVAPPDNQLLQLQSHQQSHQPARHKQRRFLLPHA 
YALFDFASGEATDLKFKKGDLILIKHRIDNNWFVG 
NDEEGCLEFKKSTVIQVMRRVDHNWAEGRIGQTIGIFPIAF^ 

RALPPVPVIDPTVVTBSSSGSSNSTPGSSNSSSTSSSNNCSPNHQISLPNTPQHWASGSASVRFRDKGA 
KEKRHS I^KfiJjLGGGAPLSLLQTNRHSAEI LSLPHELS RLEVS SSTALKPTSAPQTSR 
10 LPWGYIiALFPYKPRQTDELEIjKXGC\ryiVTERC\/IX3WFKGKNW 

KYVPQNADAQMAQVQQH PVAPDVRLNNMLSMQPPDLPPRQQQATATTTS CS VWS KPVEAIaFSRKS EPKPE 
TATASTTSS S SSGAVGLMRR1»THMKTRS KS PGASLQQVPKEAI S TNVEFTTNPSAKLHPVHVRSGS CPSQ 
IjQHSQPIiNETPAAKTAAQQQQFLPKQLPSASTNSVSYGSQRW 

ASPPPPTTSVAPAVYAGGQQQVI PGGGAQSQLHANMI IAPSHRKSHSLDASHVLiSPSSNMITETVAIKASA 
1 5 TTKSPYCTRESRFRCIVPYPPNSDI ELELHLGDI IYVQRKQKNGWYKGTHARTHKTGLFPASFVEPDC 



Figure 18: POSH Domain Analysis 
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N terminus protein fragment of hPOSH (public gi:io4326i2): 
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C terminus protein fragment of hPOSH (public gi:79S9249): 
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Mouse POSH Protein sequence (Public gi: 10946922): 




Drosophila melanogaster POSH protein sequence (public gi:l773748i) 



Figure 19 
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Figure 20. 



L -Domain motif wild-type PPEY ATAP 

ChaSe (Tl) DM 41 llt<t 0I1 48 0I24B 0114801148 



Figure 21 
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Figure 22: Human POSH has ubiquitin ligase activity 
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Figure 23 




Figure 24. Knock-down of POSH entraps HTV virus particles in intracellular 
vesicles 



Figure 25. POSH is localized at two sites: one at the nuclei and one at the golgi (C). 
After HIV transfection recruitment of POSH to the golgi is enhanced (D). 




Figure 26 



293T cells transfected with: 
pNLenv-1 (WT HIV plasmid) 
HA-Ubiquitin 

POSH plasmids 



Extract prepared by Triton X-100 extraction at 30°C 



Immunoprecipitation of extracts from 10 cm dishes 
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Immunoblot with anti-HA 
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Figure 27 
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